Skip to content

TeleHuman/AssemLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸ—οΈ AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

Zhi Jing1,2, Jinbin Qiao2,3, Ouyang Lu2,4, Jicong Ao2, Shuang Qiu5, Yu-Gang Jiang1,*, Chenjia Bai2,*

1Fudan University†, 2Institute of Artificial Intelligence (TeleAI), China Telecom†,

3Tianjin University, 4Northwestern Polytechnical University, 5City University of Hong Kong

* Equal advising | † Equally leading organizations

Paper arXiv Model Project Page Code

πŸš€ News

  • [2026-04-16] πŸ—ΊοΈ Announce the open-source roadmap, including plans for releasing inference weights, code, datasets, and future improved versions.

  • [2026-04-10] πŸ“„ Upload the paper to arXiv: paper

  • [2026-03-15] πŸŽ‰ Release the first version of the project page.

  • [2026-03-05] πŸ—οΈ Create the project page and code repository.

πŸ—ΊοΈ Open-Source Roadmap

  • [Coming within 1 week] πŸ”“ Release AssemLM-v1 inference weights, inference code, and a sample dataset.
  • [Planned] πŸ“¦ Release the majority of the AssemBench dataset.
  • [Planned] πŸ“š Release additional datasets and benchmark resources.
  • [Planned] 🧠 Release the training code.
  • [Planned] βš™οΈ Release the data processing pipeline.
  • [Planned] πŸš€ Release updated and improved model weights.

About

Official Implementation of "AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors