Skip to content

TeleHuman/AssemLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

🏗️ AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

Zhi Jing1,2, Jinbin Qiao2,3, Ouyang Lu2,4, Jicong Ao2, Shuang Qiu5, Yu-Gang Jiang1,*, Chenjia Bai2,*

1Fudan University, 2Institute of Artificial Intelligence (TeleAI), China Telecom,

3Tianjin University, 4Northwestern Polytechnical University, 5City University of Hong Kong

* Equal advising | Equally leading organizations

Paper arXiv Model Project Page Code

🚀 News

  • [2026-04-16] 🗺️ Announce the open-source roadmap, including plans for releasing inference weights, code, datasets, and future improved versions.

  • [2026-04-10] 📄 Upload the paper to arXiv: paper

  • [2026-03-15] 🎉 Release the first version of the project page.

  • [2026-03-05] 🏗️ Create the project page and code repository.

🗺️ Open-Source Roadmap

  • [Coming within 1 week] 🔓 Release AssemLM-v1 inference weights, inference code, and a sample dataset.
  • [Planned] 📦 Release the majority of the AssemBench dataset.
  • [Planned] 📚 Release additional datasets and benchmark resources.
  • [Planned] 🧠 Release the training code.
  • [Planned] ⚙️ Release the data processing pipeline.
  • [Planned] 🚀 Release updated and improved model weights.

About

Official Implementation of "AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors