[Feature] NVIDIA DGX Spark (GB10, sm_121a) Support Tracking

This issue tracks the progress of adding support for the **NVIDIA DGX Spark** system (GB10, `sm_121a`).

The benchmark results published in the [LMSYS blog post](https://lmsys.org/blog/2025-10-13-nvidia-dgx-spark/), along with the Docker image tag for Spark (`lmsysorg/sglang:spark`), were produced using a **custom SGLang snapshot** from my personal development branch: https://github.com/sgl-project/sglang/compare/main...yvbbrjdr:sglang:spark.

The branch currently includes several **temporary workarounds** that need to be properly addressed before it can be merged into `main`. These include:

1. **Outdated base:** The branch is approximately two weeks old, and rebasing onto `main` may not succeed cleanly or work properly.
2. **PyTorch compatibility:** The official PyTorch release does not yet support **CUDA 13.0**, so a **nightly build** was used in a custom Dockerfile.
3. **Triton issue:** Running GPT-OSS models triggers triton-lang/triton#8335, which remains unresolved.
4. **FP8 kernel dispatch:** FP8 CUTLASS kernels currently fail to dispatch on GB10 (`sm_121a`). As a temporary workaround, they are disabled on this branch, causing PyTorch to fall back to legacy FP8 inference kernels with reduced performance.
5. **Dependency status:** All external dependencies (except for `sgl-kernel`, which must be rebuilt for `sm_121a`) have been disabled due to unknown compatibility. It’s unclear which git tags or commits of these dependencies are compatible with the GB10 architecture.

Additionally, @johnnynunez has opened related PRs for CUDA 13 and FA4 support, which may help resolve some of the above issues: #11299, #11606 (Thank you!)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] NVIDIA DGX Spark (GB10, sm_121a) Support Tracking #11658

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] NVIDIA DGX Spark (GB10, sm_121a) Support Tracking #11658

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions