EBench Docs
Run Evaluation
1. Start the server
Section titled “1. Start the server”python ray_eval_server.py --host 0.0.0.0 --port 8087 --no_save_processOr with a local Isaac Sim installation:
/isaac-sim/python.sh ray_eval_server.py --host 0.0.0.0 --port 8087 --no_save_process2. Submit a task
Section titled “2. Submit a task”From the client environment, submit a benchmark job:
gmp submit ebench/mobile_manip/test --run_id my_first_runAvailable task settings:
| Task setting | Description |
|---|---|
ebench/mobile_manip/<split> | Pick-and-place with mobile base |
ebench/table_top_manip/<split> | Dexterous tabletop tasks |
ebench/generalist/<split> | Mixed tasks across categories |
Splits: val_train, val_unseen, test
Submit all tasks at once with gmp submit ebench --run_id full_run.
3. Connect your model
Section titled “3. Connect your model”Quick connectivity check with the built-in baseline:
gmp eval -a r5a -g lift2 --worker_ids 0For your own model, see Integrate Your Own Model.
4. Check results
Section titled “4. Check results”gmp statusResults are saved to saved/eval_results/<task>/<run_id>/.
When running server and client on different machines, pass
--host <ip> --port <port>to allgmpcommands. See the GMP CLI reference for all options.