-
Notifications
You must be signed in to change notification settings - Fork 71
Labels
Description
SGLang-Jax multimodal models support
If there is any problem, you can raise it here. #488
Goal
- Design and implement inference sequences for multiple modal models.
- Implement the Wan model and ensure its correctness.
- Implement the Mimo-Audio model and ensure its correctness.
- Implement the Qwen2.5-VL model and ensure its correctness.
Plans
Research Doc
we can use google doc to record
-
SGLang @SII-limingliu
-- tokenizer/detokenizer @lianga1
-- scheduler/kv cache @SII-limingliu
-- model runner @pathfinder-pf -
VLLM @zkkython @SiqiLi-Fighting
-
xDiT @JamesBrianD
Design Doc
Host Compoment @SII-limingliu @pathfinder-pf
Device Compoment @zkkython @SiqiLi-Fighting
Work Assignment
- Implement the Wan baseline model in bonsai and ensure its correctness. @labyrinth-ssr @Iamleos
- Implement the Mimo-Audio baseline model in bonsai and ensure its correctness. @Mozoltov821 @SiqiLi-Fighting
TODO
Review
TODO
Test
TODO
Benchmark & Profile
TODO
Community Members
SII-Team : @SII-limingliu @yangdian96 @liao1995 @lianga1
SGLang-Jax Team : @zkkython @pathfinder-pf @SiqiLi-Fighting @JamesBrianD
Discord
Reactions are currently unavailable