[Deepseek V3.2] Fix Deepseek MTP in V1 mode by b8zhong · Pull Request #15429 · sgl-project/sglang

b8zhong · 2025-12-19T00:53:15Z

Motivation

Modifications

Restore the earlier behaviour of Deepseek V3 + MTP. (Accidental change)

Fix #15428

gemini-code-assist · 2025-12-19T00:53:19Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

b8zhong · 2025-12-19T00:54:02Z

/tag-and-rerun-ci

zhyncs · 2025-12-19T01:09:47Z

python/sglang/srt/layers/attention/nsa_backend.py

                # were accepted.
                page_table = torch.repeat_interleave(
-                    page_table, repeats=extend_seq_lens_cpu, dim=0
+                    page_table, repeats=forward_batch.extend_seq_lens, dim=0


may we add a unit test for this, so that we will not break next time

We have a per-commit unit test for MTP and the prior PR passed
https://github.com/sgl-project/sglang/actions/runs/20293560819/job/58356702062?pr=15307
Seems this bug only happens during deepgemm precompile

good to know

* 'main' of https://github.com/sgl-project/sglang: (136 commits) fix: unreachable error check in retraction (sgl-project#15433) [sgl-kernel] chore: update deepgemm version (sgl-project#13402) [diffusion] multi-platform: support diffusion on amd and fix encoder loading on MI325 (sgl-project#13760) [amd] Add deterministic all-reduce kernel for AMD (ROCm) (sgl-project#15340) [diffusion] refactor: refactor _build_req_from_sampling to use shallow_asdict (sgl-project#13782) Add customized sampler registration (sgl-project#15423) Update readme (sgl-project#15425) Fix Mindspore model import warning (sgl-project#15287) [Feature] Xiaomi `MiMo-V2-Flash` day0 support (sgl-project#15207) [diffusion] profiling: add bench_serving.py and VBench (sgl-project#15410) [DLLM] Fix dLLM regression (sgl-project#15371) [Deepseek V3.2] Fix Deepseek MTP in V1 mode (sgl-project#15429) chore: update CI_PERMISSIONS (sgl-project#15431) [DLLM] Add CI for diffusion LLMs (sgl-project#14723) Support using different attention backend for draft decoding. (sgl-project#14843) feat(dsv32): better error handling for DeepSeek-v3.2 encoder (sgl-project#14353) tiny fix lint on main (sgl-project#15424) multimodal: precompute hash for MultimodalDataItem (sgl-project#14354) [AMD] Clear pre-built AITER kernels and warmup to prevent segfaults and test timeouts (sgl-project#15318) [Performance] optimize NSA backend metadata computation for multi-step speculative decoding (sgl-project#14781) ...

more

fd22677

b8zhong requested review from BBuf, Edwardf0t1, Fridge003, HaiShaw, Ying1123, ch-wan, ispobock and merrymercy as code owners December 19, 2025 00:53

b8zhong changed the title ~~[Deepseek V3.2] Fix~~ [Deepseek V3.2] Fix Deepseek MTP in V1 mode Dec 19, 2025

github-actions bot added the run-ci label Dec 19, 2025

momaek mentioned this pull request Dec 19, 2025

fix(nsa_backend): use Tensor instead of list for repeat_interleave peats #15430

Closed

zhyncs reviewed Dec 19, 2025

View reviewed changes

Fridge003 approved these changes Dec 19, 2025

View reviewed changes

Fridge003 merged commit e88e75a into sgl-project:main Dec 19, 2025
139 of 151 checks passed

b8zhong deleted the brayden/fix-deepseek-nsa-mtp branch December 19, 2025 03:58

Prozac614 pushed a commit to Prozac614/sglang that referenced this pull request Dec 23, 2025

[Deepseek V3.2] Fix Deepseek MTP in V1 mode (sgl-project#15429)

b385828

jiaming1130 pushed a commit to zhuyijie88/sglang that referenced this pull request Dec 25, 2025

[Deepseek V3.2] Fix Deepseek MTP in V1 mode (sgl-project#15429)

ffd6889

YChange01 pushed a commit to YChange01/sglang that referenced this pull request Jan 13, 2026

[Deepseek V3.2] Fix Deepseek MTP in V1 mode (sgl-project#15429)

1cea604

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Deepseek V3.2] Fix Deepseek MTP in V1 mode#15429

[Deepseek V3.2] Fix Deepseek MTP in V1 mode#15429
Fridge003 merged 1 commit intosgl-project:mainfrom
bzhng-development:brayden/fix-deepseek-nsa-mtp

b8zhong commented Dec 19, 2025

Uh oh!

gemini-code-assist bot commented Dec 19, 2025

Uh oh!

b8zhong commented Dec 19, 2025

Uh oh!

zhyncs Dec 19, 2025

Uh oh!

Fridge003 Dec 19, 2025 •

edited

Loading

Uh oh!

zhyncs Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

b8zhong commented Dec 19, 2025

Motivation

Modifications

Uh oh!

gemini-code-assist bot commented Dec 19, 2025

Uh oh!

b8zhong commented Dec 19, 2025

Uh oh!

zhyncs Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Fridge003 Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhyncs Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fridge003 Dec 19, 2025 •

edited

Loading