GitHub - marmotlab/ORION-multi-agent-navigation: ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

🔹 ORION is an efficient RL planner for multi-agent navigation in partially known environments.

🔹 ORION enables real-time, decentralized cooperation by coordinating individual target-reaching and team-level online uncertainty reduction via option-based networks and dual-stage navigation strategy.

🔹 ORION's paper can be found here.

Environment Setup

We use conda/mamba to manage the environment.

conda create -n orion python=3.10 -y
conda activate orion

pip install torch torchvision
pip install opencv-python scikit-image imageio pandas
pip install matplotlib tensorboard
pip install ray wandb

Clone this repository and navigate to the directory.

git clone https://github.com/marmotlab/ORION-multi-agent-navigation.git
cd ORION-multi-agent-navigation

Datasets and Checkpoints

Training datasets are provided in:

maps_priori/
maps_GT/

Evaluation datasets are provided in:

maps_priori_test_new_{n}/
maps_GT_test_new_{n}/

where {n} denotes the number of agents in the team.

The training set consists of simple maps with 3 agents only.
During evaluation, ORION scales to larger teams (3, 4, 5, and 10 agents) and more complex environments without additional training.

We also provide a pretrained checkpoint. As ORION is a decentralized multi-agent navigation planner, the same checkpoint can be directly applied to different team sizes.

Examples of training (left) and evaluation (right) maps.

Training and Evaluation

For training, configure the parameters in parameter.py, then run:

python driver.py

For evaluation, configure the parameters in test_parameter.py, then run:

python test_driver.py

Inline comments are provided in both files to facilitate parameter configuration.

Roadmap

✅ ORION paper released: https://arxiv.org/abs/2601.01155
✅ Training and evaluation code released
⏳ ROS-based implementation (coming soon)

Credit

If you find this work helpful, please consider citing:

@article{shizhe2026orion,
  title={ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation},
  author={Shizhe, Zhang and Jingsong, Liang and Zhitao, Zhou and Shuhan, Ye and Yizhuo, Wang and Derek, Tan Ming Siang and Jimmy, Chiun and Yuhong, Cao and Guillaume, Sartoretti},
  journal={arXiv preprint arXiv:2601.01155},
  year={2026}
}

ORION is inspired by following works, and we thank them for their contributions!

Context-Aware Deep Reinforcement Learning for Autonomous Robotic Navigation in Unknown Area, CoRL 2023
The Option-Critic Architecture, AAAI 2017
ARiADNE ROS Planner, ICRA 2023/RA-L 2024
CMU Development environment
Octomap

Authors

Shizhe Zhang*, Jingsong Liang*, Zhitao Zhou, Shuhan Ye, Yizhuo Wang, Derek Ming Siang Tan, Jimmy Chiun, Yuhong Cao, Guillaume Sartoretti

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
imgs		imgs
maps_GT		maps_GT
maps_GT_test_new_10		maps_GT_test_new_10
maps_GT_test_new_3		maps_GT_test_new_3
maps_GT_test_new_4		maps_GT_test_new_4
maps_GT_test_new_5		maps_GT_test_new_5
maps_priori		maps_priori
maps_priori_test_new_10		maps_priori_test_new_10
maps_priori_test_new_3		maps_priori_test_new_3
maps_priori_test_new_4		maps_priori_test_new_4
maps_priori_test_new_5		maps_priori_test_new_5
model		model
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
driver.py		driver.py
env.py		env.py
model.py		model.py
multi_agent_worker.py		multi_agent_worker.py
node_manager.py		node_manager.py
node_manager_GT_for_reward.py		node_manager_GT_for_reward.py
node_manager_GroundTruth.py		node_manager_GroundTruth.py
parameter.py		parameter.py
quads.py		quads.py
runner.py		runner.py
sensor.py		sensor.py
test_driver.py		test_driver.py
test_parameter.py		test_parameter.py
test_worker.py		test_worker.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

Environment Setup

Datasets and Checkpoints

Training and Evaluation

Roadmap

Credit

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

Environment Setup

Datasets and Checkpoints

Training and Evaluation

Roadmap

Credit

Authors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages