Skip to content

[Roadmap] AMD Instinct Development (2025Q4) #12890

@HaiShaw

Description

@HaiShaw

Checklist

Motivation

Focus

  • Distributed Inference Performance and Scaling
  • Models performance with kernel optimizations
  • Extended Models coverage: Thinking, Diffusion

Distributed Inference

  • Large-scale EP with MoRI
  • PD/D performance
  • OME Integration
  • Gateway support

Performance Optimizations

  • Communication Kernels: fusion, quantization
  • Attention Kernels: performance, fp8
  • FP8 KV cache with AITER kernels
  • DeepSeek 3.2 performance
  • gpt-oss performance

Model coverage

Reinforcement learning training framework

  • slime integration improvements

Quantization

  • More MxFP4/FP8 enablement
  • Quark Integration as utility

Related resources

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions