Skip to content

An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.

License

Notifications You must be signed in to change notification settings

InternRobotics/InternManip

Repository files navigation

demo

Gradio Demo doc GitHub star chart GitHub Issues Discord

InternManip

An All-in-one Robot Manipulation Learning Suite for Polcy Models Training and Evaluation on Various Datasets and Benchmarks.

🏠 Highlights

InternManip provides the infrastructure for reproducing & developing the state-of-the-art robot manipulation policies, standardizing πŸ—„οΈdataset formats, βš™οΈmodel interfaces, and πŸ“evaluation protocols.

Available Content

Policy Models Training Datasets Benchmarks

What can you do with InternManip?

  • πŸ”„ Reproduce state-of-the-art policy models on popular robot manipulation datasets.
  • πŸ“Š Train new policies with heterogeneous policy architecture: end2end model (VLA, Action Expert) & agent framework.
  • 🌍 Flexible policy deployment in any third-party benchmarks via a client-server setup.

What's included?

  • βœ… Unified dataset format & loaders for 4+ datasets.
  • βœ… 5 pre-integrated policy models for training & evaluation.
  • βœ… Standard training workflow and server-client evaluation engine.

Why InternManip?

  • πŸ™…πŸ»β€β™‚οΈ Stop re-implementing baselines.
  • πŸ™…πŸ» Stop struggling with dataset formats.
  • πŸ’‘ Focus on policy innovation, not infrastructure.

πŸ”₯ News

  • [2025/07] We are hosting πŸ†IROS 2025 Grand Challenge, stay tuned at official website.
  • [2025/07] Try the SOTA models on GenManip at Gradio Demo.
  • [2025/07] InternManip v0.1.0 released, change log.

πŸ“‹ Table of Contents

πŸš€ Getting Started

Prerequisites

  • Ubuntu 20.04, 22.04
  • CUDA 12.4
  • GPU: The GPU requirements for model running and simulation are different, as shown in the table below:
GPU Model Training & Inference Simulation
CALVIN Simpler-Env Genmanip
NVIDIA RTX Series βœ… βœ… βœ… βœ…
NVIDIA V/A/H100 βœ… βœ… βœ… ❌

Note

We provide a flexible installation tool for users who want to use InternManip for different purposes. Users can choose to install the training and inference environment, and the individual simulation environment independently.

Installation

We provide the installation guide here. You can install locally or use docker and verify the installation easily.

πŸ“š Documentation & Tutorial (WIP)

We provide detailed docs for the basic usage of different modules supported in InternManip. Here are some shortcuts to common scenarios:

Welcome to try and post your suggestions!

πŸ“¦ Benchmarks & Baselines (WIP)

InternManip offers implementations of multiple manipulation policy modelsβ€”GR00T-N1, GR00T-N1.5, Pi-0, DP-CLIP, and ACT-CLIPβ€”as well as curated datasets including GenManip, Simpler-Env, and CALVIN, all organized in the standardized LeRobot format.

The available ${MODEL}, ${DATASET}, ${BENCHMARK} and their results are summarized in the following tables:

CALVIN (ABC-D) Benchmark

Model Dataset/Benchmark Score (Main Metric) Model Weights
gr00t_n1 calvin_abcd
gr00t_n1_5 calvin_abcd
pi0 calvin_abcd
dp_clip calvin_abcd
act_clip calvin_abcd

Simpler-Env Benchmark

Model Dataset/Benchmark Success Rate Model Weights
gr00t_n1 google_robot
gr00t_n1_5 google_robot
pi0 google_robot
dp_clip google_robot
act_clip google_robot
gr00t_n1 bridgedata_v2
gr00t_n1_5 bridgedata_v2
pi0 bridgedata_v2
dp_clip bridgedata_v2
act_clip bridgedata_v2

Genmanip Benchmark

Model Dataset/Benchmark Success Rate Model Weights
gr00t_n1 genmanip_v1
gr00t_n1_5 genmanip_v1
pi0 genmanip_v1
dp_clip genmanip_v1
act_clip genmanip_v1

Please refer to the benchmark documentation for more details on how to run the benchmarks and reproduce the results.

πŸ”§ Support

Join our WeChat support group or Discord for any help.

πŸ‘₯ Contribute

If you would like to contribute to InternManip, please check out our contribution guide. For example, raising issues, fixing bugs in the framework, and adapting or adding new policies and data to the framework.

πŸ”— Citation

If you find our work helpful, please cite:

@misc{internmanip2025,
    title = {InternManip: An All-in-one Robot Manipulation Learning Suite for Polcy Models Training and Evaluation on Various Datasets and Benchmarks},
    author = {InternManip Contributors},
    howpublished={\url{https://github.com/InternRobotics/InternManip}},
    year = {2025}
}
@inproceedings{gao2025genmanip,
    title={GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation},
    author={Gao, Ning and Chen, Yilun and Yang, Shuai and Chen, Xinyi and Tian, Yang and Li, Hao and Huang, Haifeng and Wang, Hanqing and Wang, Tai and Pang, Jiangmiao},
    booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
    pages={12187--12198},
    year={2025}
}
@inproceedings{grutopia,
    title={GRUtopia: Dream General Robots in a City at Scale},
    author={Wang, Hanqing and Chen, Jiahe and Huang, Wensi and Ben, Qingwei and Wang, Tai and Mi, Boyu and Huang, Tao and Zhao, Siheng and Chen, Yilun and Yang, Sizhe and Cao, Peizhou and Yu, Wenye and Ye, Zichao and Li, Jialun and Long, Junfeng and Wang, ZiRui and Wang, Huiling and Zhao, Ying and Tu, Zhongying and Qiao, Yu and Lin, Dahua and Pang Jiangmiao},
    year={2024},
    booktitle={arXiv},
}

πŸ“ TODO List

  • Release the baseline methods, checkpoints and benchmark data.
  • Release the guidance and tutorials.
  • Polish APIs and related codes.
  • Support closed-loop evaluation.
  • Release the technical report.
  • Support online interactive training.

πŸ“„ License

InternManip's assets and codes are MIT licensed. The open-sourced data are under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License Creative Commons License. Other datasets (including Calvin and Simpler) inherit their own distribution licenses.

πŸ‘ Acknowledgements

  • CALVIN: A synthetic benchmark for training and evaluating robotic manipulation policies.
  • Simpler-Env: A real-sim consistent manipulation benchmark for evaluating robotic manipulation policies.
  • Isaac GR00T: This codebase is developed on top of the Isaac GR00T framework, with substantial restructuring and customization to better suit our experimental needs.
  • LeRobot: The data format used in this project largely follows the conventions of LeRobot.
  • InternUtopia (Previously GRUtopia): The evaluation on GenManip relies on the InternUtopia platform.
  • Isaac Lab: We use some utilities from Orbit (Isaac Lab) for driving articulated joints in Isaac Sim.

About

An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published