[Roadmap] AMD Instinct Development (2025Q4)

### Checklist

- [x] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- [x] 2. Please use English, otherwise it will be closed.

### Motivation

# Focus
- Distributed Inference Performance and Scaling
- Models performance with kernel optimizations
- Extended Models coverage: Thinking, Diffusion

# Distributed Inference
- Large-scale EP with MoRI
- PD/D performance
- OME Integration
- Gateway support


# Performance Optimizations
- Communication Kernels: fusion, quantization
- Attention Kernels: performance, fp8
- FP8 KV cache with AITER kernels
- DeepSeek 3.2 performance
- gpt-oss performance


# Model coverage
- [Kimi-K2-Thinking](https://huggingface.co/moonshotai/Kimi-K2-Thinking)
- [Ring-1T](https://huggingface.co/inclusionAI/Ring-1T)
- [Wan2.1](https://huggingface.co/Wan-AI/Wan2.1-VACE-14B)
- [Qwen-Image](https://github.com/QwenLM/Qwen-Image)

# Reinforcement learning training framework
- slime integration improvements

# Quantization
- More MxFP4/FP8 enablement
- Quark Integration as utility




### Related resources

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] AMD Instinct Development (2025Q4) #12890

Checklist

Motivation

Focus

Distributed Inference

Performance Optimizations

Model coverage

Reinforcement learning training framework

Quantization

Related resources

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Roadmap] AMD Instinct Development (2025Q4) #12890

Description

Checklist

Motivation

Focus

Distributed Inference

Performance Optimizations

Model coverage

Reinforcement learning training framework

Quantization

Related resources

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions