Skip to content
English

EBench Docs

GMP CLI

Install the genmanip-client package in your client environment:

Terminal window
pip install -e standalone_tools/packages/genmanip_client/
gmp --help
CommandPurpose
gmp submitSubmit or reconnect benchmark tasks on the eval server.
gmp statusInspect progress and metrics for the current run.
gmp evalRun client workers and interact with server episodes.
gmp plotPost-process episode outputs into visualization artifacts.
gmp cleanRemove generated caches, logs, eval outputs, and temporary leftovers.
gmp visualizeBrowse eval results and replay episodes in the Rerun viewer.

Benchmark family + split:

Terminal window
gmp submit ebench/mobile_manip/test --run_id mobile_test
gmp submit ebench/table_top_manip/val_unseen --run_id tabletop_val_unseen
gmp submit ebench/generalist/val_train --run_id generalist_val_train

Benchmark alias:

Terminal window
gmp submit ebench --run_id full_benchmark

Supported task-setting paths:

Task settings:

  • mobile_manip
  • table_top_manip
  • generalist

Splits:

  • val_train
  • val_unseen
  • test
Terminal window
gmp status --host 127.0.0.1 --port 8087
gmp submit ebench --run_id history_id
gmp status
Terminal window
gmp eval -a r5a -g lift2 --worker_ids 0 --frame_save_interval 10
gmp eval --worker_ids 0,1 --chunk_size 8 --host 127.0.0.1 --port 8087

For custom model integration, see Integrate Your Own Model.

Terminal window
gmp plot client_results/<benchmark>/<run_id>/<task>/<seed>

Use gmp clean to remove generated artifacts from local runs.

Preview what would be removed:

Terminal window
gmp clean --dry-run

Remove generated mesh cache, eval results, logs, and leftover lock/tmp files:

Terminal window
gmp clean

Also remove downloaded benchmark package cache:

Terminal window
gmp clean --all

gmp visualize starts a local HTTPS viewer for browsing runs, task success rates, and per-episode replays.

Install the visualize extra:

Terminal window
pip install -e "standalone_tools/packages/genmanip_client/[visualize]"

Basic usage:

Terminal window
gmp visualize
gmp visualize --port 55088

Cache management:

Terminal window
gmp visualize --flush-cache --dry-run
gmp visualize --flush-cache

Notes:

  • gmp visualize expects evaluation outputs under saved/eval_results/.
  • The viewer uses HTTPS and may show a one-time browser certificate warning.
  • The current rerun-sdk path used by visualize requires Python 3.11+.
  • --run_id: identifies and resumes a run.
  • --host, --port: eval server target (default is local 127.0.0.1:8087).
  • --worker_ids: worker allocation in gmp eval.
  • --frame_save_interval: client-side frame saving frequency.
  • --chunk_size: action chunk length when your model predicts chunked actions.