SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning

Xuchen Li^1,3, Ruitao Wu^1,4, Xuanbo Liu^1,4, Xukai Wang^1,3, Jinbo Hu⁵, Zhixin Bai^1,6, Bohan Zeng⁵, Hao Liang^1,5, Leheng Chen^1,5, Mingrui Chen^1,3, Haitian Zhong^1,3, Xuanlin Yang^1,5, Xu-Yao Zhang³, Liu Liu⁴, Jia Li⁴, Kaiqi Huang³, Jiahao Xu², Haitao Mi², Wentao Zhang^1,5, Bin Dong^1,5

¹Zhongguancun Academy ²Tencent AI Lab ³Institute of Automation, Chinese Academy of Sciences ⁴Beihang University ⁵Peking University ⁶Nanjing University

Official implementation of SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning

We compare SciAgent's performance (represented by the striped blue bars) with the highest, average, and lowest gold medalist scores across five competitions: IMO25, IMC25, IPhO25, CPhO25 and IPhO24. Our SciAgent achieves gold medal performance in all tasks, surpassing the average gold medalist score, and its performance in IMC25 and CPhO25 is on par with or even exceeds the highest human gold medalist scores.

Abstract: Recent advances in large language models have enabled AI systems to achieve expert-level performance on domain-specific scientific tasks, yet these systems remain narrow and handcrafted. We introduce SciAgent, a unified multi-agent system designed for generalistic scientific reasoning—the ability to adapt reasoning strategies across disciplines and difficulty levels. SciAgent organizes problem solving as a hierarchical process: a Coordinator Agent interprets each problem’s domain and complexity, dynamically orchestrating specialized Worker Systems, each composed of interacting reasoning Sub-agents for symbolic deduction, conceptual modeling, numerical computation, and verification. These agents collaboratively assemble and refine reasoning pipelines tailored to each task. Across mathematics and physics Olympiads (IMO, IMC, IPhO, CPhO), SciAgent consistently attains or surpasses human gold-medalist performance, demonstrating both domain generality and reasoning adaptability. Additionally, SciAgent has been tested on the International Chemistry Olympiad (IChO) and selected problems from the Humanity’s Last Exam (HLE) benchmark, further confirming the system’s ability to generalize across diverse scientific domains. This work establishes SciAgent as a concrete step toward generalistic scientific intelligence—AI systems capable of coherent, cross-disciplinary reasoning at expert levels.

Overview

Conceptual contribution: We introduce generalistic scientific reasoning as a new paradigm for AI in science, emphasizing adaptability across domains and modalities.
Architectural innovation: We propose a Coordinator–Worker–Sub-agents hierarchy in which the Coordinator performs domain-adaptive routing and the Worker Systems self-assemble internal multi-agent pipelines.
Dynamic reasoning mechanism: We demonstrate self-assembling, feedback-driven reasoning loops that integrate symbolic deduction, conceptual modeling, and quantitative computation.
Empirical validation: We show that SciAgent achieves gold-medal-level performance on IMO 2025, IMC 2025, IPhO 2024/2025, and CPhO 2025, and maintains strong generalization on IChO 2025 and the Humanity’s Last Exam benchmark—providing evidence of reasoning transfer rather than narrow specialization.

SciAgent consists of a hierarchical multi-agent framework with a Coordinator Agent that routes problems to domain-specific Worker Systems. Each Worker System—Math, Physics, Chemistry, and General Exam—contains multiple Sub-agents (e.g., Generator, Reviewer, Image Analyser) collaborating through adaptive reasoning loops. The right panel summarizes key design principles: hierarchical meta-reasoning, modularity, and adaptive assembly.

TODO 🚀

All code releases will be made by November 16.

[] Release the code of SciAgent framework.
[] Release the code of Math Olympiad Worker System.
[] Release the code of Physics Olympiad Worker System.
[] Release the code of Chemistry Olympiad Worker System.
[] Release the code of General Exam Worker System.

BibTeX

@article{sciagent-zgca,
  title={SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning},
  author={Xuchen Li, Ruitao Wu, Xuanbo Liu, Xukai Wang, Jinbo Hu, Zhixin Bai, Bohan Zeng, Hao Liang, Leheng Chen, Mingrui Chen, Haitian Zhong, Xuanlin Yang, Xu-Yao Zhang, Liu Liu, Jia Li, Kaiqi Huang, Jiahao Xu, Haitao Mi, Wentao Zhang, Bin Dong},
  journal={arXiv preprint arXiv:2511.08151},
  year={2025}
}

Acknowledgement

This code is heavily inspired by Somlagents, IMO25 and Physics-Supernova. Thank you for your outstanding work!

License

This repository is licensed under the Apache License Version 2.0. You are free to use, modify, and distribute this code in compliance with the terms and conditions of the Apache License Version 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
assets		assets
docs		docs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning

Overview

TODO 🚀

BibTeX

Acknowledgement

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning

Overview

TODO 🚀

BibTeX

Acknowledgement

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages