EBench Docs
GMP CLI
Install
Section titled “Install”Install the genmanip-client package in your client environment:
pip install -e standalone_tools/packages/genmanip_client/gmp --helpCore commands
Section titled “Core commands”| Command | Purpose |
|---|---|
gmp submit | Submit or reconnect benchmark tasks on the eval server. |
gmp status | Inspect progress and metrics for the current run. |
gmp eval | Run client workers and interact with server episodes. |
gmp plot | Post-process episode outputs into visualization artifacts. |
gmp clean | Remove generated caches, logs, eval outputs, and temporary leftovers. |
gmp visualize | Browse eval results and replay episodes in the Rerun viewer. |
Submit, status, and eval
Section titled “Submit, status, and eval”gmp submit
Section titled “gmp submit”Benchmark family + split:
gmp submit ebench/mobile_manip/test --run_id mobile_testgmp submit ebench/table_top_manip/val_unseen --run_id tabletop_val_unseengmp submit ebench/generalist/val_train --run_id generalist_val_trainBenchmark alias:
gmp submit ebench --run_id full_benchmarkSupported task-setting paths:
Task settings:
mobile_maniptable_top_manipgeneralist
Splits:
val_trainval_unseentest
gmp status
Section titled “gmp status”gmp status --host 127.0.0.1 --port 8087gmp submit ebench --run_id history_idgmp statusgmp eval
Section titled “gmp eval”gmp eval -a r5a -g lift2 --worker_ids 0 --frame_save_interval 10gmp eval --worker_ids 0,1 --chunk_size 8 --host 127.0.0.1 --port 8087For custom model integration, see Integrate Your Own Model.
Clean, plot, and visualize
Section titled “Clean, plot, and visualize”gmp plot
Section titled “gmp plot”gmp plot client_results/<benchmark>/<run_id>/<task>/<seed>gmp clean
Section titled “gmp clean”Use gmp clean to remove generated artifacts from local runs.
Preview what would be removed:
gmp clean --dry-runRemove generated mesh cache, eval results, logs, and leftover lock/tmp files:
gmp cleanAlso remove downloaded benchmark package cache:
gmp clean --allgmp visualize
Section titled “gmp visualize”gmp visualize starts a local HTTPS viewer for browsing runs, task success rates, and per-episode replays.
Install the visualize extra:
pip install -e "standalone_tools/packages/genmanip_client/[visualize]"Basic usage:
gmp visualizegmp visualize --port 55088Cache management:
gmp visualize --flush-cache --dry-rungmp visualize --flush-cacheNotes:
gmp visualizeexpects evaluation outputs undersaved/eval_results/.- The viewer uses HTTPS and may show a one-time browser certificate warning.
- The current
rerun-sdkpath used by visualize requires Python 3.11+.
Common options
Section titled “Common options”--run_id: identifies and resumes a run.--host,--port: eval server target (default is local127.0.0.1:8087).--worker_ids: worker allocation ingmp eval.--frame_save_interval: client-side frame saving frequency.--chunk_size: action chunk length when your model predicts chunked actions.