ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model

Tengbo Yu ^*, Guanxing Lu ^*, Zaijia Yang ^*, Haoyuan Deng, Season Si Chen, Jiwen Lu, Wenbo Ding, Guoqiang Hu, Yansong Tang ^†, Ziwei Wang

[arXiv] | [PDF]

Multi-task robotic bimanual manipulation is becoming increasingly popular as it enables sophisticated tasks that require diverse dual-arm collaboration patterns. Compared to unimanual manipulation, bimanual tasks pose challenges to understanding the multi-body spatiotemporal dynamics. An existing method ManiGaussian pioneers encoding the spatiotemporal dynamics into the visual representation via Gaussian world model for single-arm settings, which ignores the interaction of multiple embodiments for dual-arm systems with significant performance drop. In this paper, we propose ManiGaussian++, an extension of ManiGaussian framework that improves multi-task bimanual manipulation by digesting multi-body scene dynamics through a hierarchical Gaussian world model. To be specific, we first generate task-oriented Gaussian Splatting from intermediate visual features, which aims to differentiate acting and stabilizing arms for multibody spatiotemporal dynamics modeling. We then build a hierarchical Gaussian world model with the leader-follower architecture, where the multi-body spatiotemporal dynamics is mined for intermediate visual representation via future scene prediction. The leader predicts Gaussian Splatting deformation caused by motions of the stabilizing arm, through which the follower generates the physical consequences resulted from the movement of the acting arm. As a result, our method significantly outperforms the current state-of-the-art bimanual manipulation techniques by an improvement of 20.2% in 10 simulated tasks, and achieves 60% success rate on average in 9 challenging real-world tasks.

🎉 NEWS:

Jun. 2025: Codebase for simulated experiments is released!
Jun. 2025: Our paper is accepted by IROS2025!

💻 Installation

NOTE: ManiGaussian++ is mainly built upon the Perceiver-Actor^2 repo by Markus Grotz et al.

See INSTALL.md for installation instructions.

🛠️ Usage

The following steps are structured in order.

🗃️ Generate Demonstrations

bash scripts/gen_demonstrations_nerf.sh

🚆 Training

We use wandb to log some curves and visualizations. Login to wandb before running the scripts.

wandb login

To train our ManiGaussian without deformation predictor, task-oriented Gaussian and hierarchical Gaussian world model

bash scripts/train_bimanual.sh ManiGaussian_BC2 0,1 12345 ${exp_name}

To train our ManiGaussian without task-oriented Gaussian and hierarchical Gaussian world, run:

bash scripts/train_bimanual_dyn.sh ManiGaussian_BC2 0,1 12345 ${exp_name}

To train our ManiGaussian without hierarchical Gaussian world, run:

bash scripts/train_LF_MASK_IN_NERF.sh ManiGaussian_BC2 0,1 12345 ${exp_name}

To train our vanilla ManiGaussian, run:

bash scripts/train_LF_MASK_IN_NERF_HIER.sh ManiGaussian_BC2 0,1 12345 ${exp_name}

Evaluation

To evaluate the checkpoint, you can use:

bash scripts/eval.sh ManiGaussian_BC2 ${exp_name} 0

🏷️ License

This repository is released under the MIT license.

🙏 Acknowledgement

Our code is built upon ManiGaussian, AnyBimanual, Perceiver-Actor^2, PerAct, RLBench, and CLIP. We thank all these authors for their nicely open sourced code and their great contributions to the community.

🔗 Citation

If you find this repository helpful, please consider citing:

@misc{yu2025manigaussiangeneralroboticbimanual,
      title={ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model}, 
      author={Tengbo Yu and Guanxing Lu and Zaijia Yang and Haoyuan Deng and Season Si Chen and Jiwen Lu and Wenbo Ding and Guoqiang Hu and Yansong Tang and Ziwei Wang},
      year={2025},
      eprint={2506.19842},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2506.19842}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
agents		agents
conf		conf
docs		docs
helpers		helpers
scripts		scripts
third_party		third_party
voxel		voxel
README.md		README.md
requirements.txt		requirements.txt
run_seed_fn.py		run_seed_fn.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model

🎉 NEWS:

💻 Installation

🛠️ Usage

🗃️ Generate Demonstrations

🚆 Training

Evaluation

🏷️ License

🙏 Acknowledgement

🔗 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model

🎉 NEWS:

💻 Installation

🛠️ Usage

🗃️ Generate Demonstrations

🚆 Training

Evaluation

🏷️ License

🙏 Acknowledgement

🔗 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages