We aim to evaluate Large Language Models (LLMs) for embodied decision-making. While many works leverage LLMs for decision-making in embodied environments, a systematic understanding of their ...
Release 1.0.24 Add new benchmark result csv files for inference timing on all models w/ RTX Pro 6000, 5090, and 4090 cards w/ PyTorch 2.9.1 Fix moved module error in ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results