Development Roadmap (2025 Q4)

Here is the development roadmap for 2025 Q4. Contributions and feedback are welcome([Open a new discussion](https://github.com/sgl-project/sglang-jax/discussions/new?category=ideas))


### Focus
- More model support
- Improve Language models performance
- Reinforcement learning training framework integration.

### Model coverage
**Language models**
- [Grok2](https://huggingface.co/xai-org/grok-2)
- [Ling 2.0](https://huggingface.co/inclusionAI/Ling-mini-2.0)
- [Qwen3-235B-A22B](https://huggingface.co/Qwen/Qwen3-235B-A22B)
- [GPT-OSS](http://huggingface.co/openai/gpt-oss-20b) #407 
- DeepSeek Series https://github.com/sgl-project/sglang-jax/issues/420

**[Multi-modal models](https://github.com/sgl-project/sglang-jax/issues/476)**
- [MiMo-Audio](https://github.com/XiaomiMiMo/MiMo-Audio)
- [Wan2.1](https://huggingface.co/Wan-AI/Wan2.1-VACE-14B)
- [Qwen2.5-VL](https://huggingface.co/Qwen/Qwen2.5-VL-72B-Instruct) #225 

### Speculative decoding
- **Reference-based speculative decoding support**
- https://github.com/sgl-project/sglang-jax/pull/378
- https://github.com/sgl-project/sglang-jax/pull/544
- https://github.com/sgl-project/sglang-jax/pull/616

### RL framework integration
- **sglang-jax serves as the inference engine backend for [tunix](https://github.com/google/tunix)**
   - Accuracy check: Test math-500 and aime24 for deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B 
      - #286 
   - Support return_logprobs: return logprobs as float32
      - #310  
   - Support pathways mode: single host & multi host
      - #326
   - Interruptible Sampling #447 


### Performance improvements
- Continue to optimize the implemented model and use the GPU version as the baseline

### LoRA Support
- **Support multi LoRA servering**
   Issue: https://github.com/sgl-project/sglang-jax/issues/311

### Deterministic inference

- https://github.com/sgl-project/sglang-jax/issues/325

### Data Parallelism
- https://github.com/sgl-project/sglang-jax/issues/3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Development Roadmap (2025 Q4) #190

Focus

Model coverage

Speculative decoding

RL framework integration

Performance improvements

LoRA Support

Deterministic inference

Data Parallelism

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Development Roadmap (2025 Q4) #190

Description

Focus

Model coverage

Speculative decoding

RL framework integration

Performance improvements

LoRA Support

Deterministic inference

Data Parallelism

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions