sync attention, deepseek doc by b8zhong · Pull Request #14335 · sgl-project/sglang

b8zhong · 2025-12-03T04:31:46Z

gemini-code-assist · 2025-12-03T04:31:50Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Copilot

Pull request overview

This PR synchronizes and updates documentation for attention backends and DeepSeek model support. The changes focus on improving clarity, adding new deployment guides, and updating technical specifications for various hardware architectures.

Key changes:

Updated attention backend documentation with refined FA4 specifications and removed outdated warnings
Enhanced DeepSeek V3/R1 documentation with expanded hardware configurations, new deployment guides, and improved formatting using structured callout blocks
Updated expert parallelism backend descriptions to use "Blackwell" instead of "SM100+" for better clarity

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
docs/index.rst	Added new documentation entries for multi-modal encoder DP and classify models; reordered references section
docs/basic_usage/deepseek_v3.md	Expanded hardware configurations, added deployment guides/blog links, improved documentation structure with callout blocks, and clarified MTP usage
docs/advanced_features/expert_parallelism.md	Updated backend descriptions to use "Blackwell" architecture name instead of "SM100+"
docs/advanced_features/attention_backend.md	Updated FA4 page size specifications, removed outdated FP8 KV cache warning, and streamlined speculative decoding constraints

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-03T04:36:07Z

docs/advanced_features/attention_backend.md

 | **FA3 (FlashAttention 3)** | n/a                       | ❌               | ✅                       | ✅              | ⚠️ (page_size=1 only) |
 | **Triton**                 | n/a                       | ❌               | ❌                       | ✅              | ⚠️ (page_size=1 only) |
-| **FA4**                    | 128                       | ❌               | ❌                       | ❌              | ❌              |
+| **FA4**                    | 1                         | ❌               | ❌                       | ❌              | ❌              |


There's an inconsistency in FA4's page size specification between the MHA and MLA tables. The MHA table (line 20) shows FA4 with page size "128", but the MLA table (line 41) shows FA4 with page size "1". Please verify which is correct and ensure consistency across both tables.

Suggested change

| **FA4** | 1 | ❌ | ❌ | ❌ | ❌ |

| **FA4** | 128 | ❌ | ❌ | ❌ | ❌ |

(its actually like this.)

docs/basic_usage/deepseek_v3.md

docs/advanced_features/attention_backend.md

docs/basic_usage/deepseek_v3.md

b8zhong · 2025-12-03T04:40:44Z

docs/basic_usage/deepseek_v3.md

+| **Quantized weights ([W4A8](https://huggingface.co/novita/Deepseek-R1-0528-W4AFP8))** | 8 x H20/100, 4 x H200 |
+| **Quantized weights ([AWQ](https://huggingface.co/QuixiAI/DeepSeek-R1-0528-AWQ))** | 8 x H100/800/20 |
+| | 8 x A100/A800 |
+| **Quantized weights ([MXFP4](https://huggingface.co/amd/DeepSeek-R1-MXFP4-Preview))** | 8, 4 x MI355X/350X |


i have personally tried the w4a8 + mxfp4 combinations, so they work fine.

Co-authored-by: Brayden Zhong <b8zhong@users.noreply.github.com>

b8zhong added 4 commits December 2, 2025 09:37

more

9e7a2ff

more

58fa488

more

d7a2ab2

more

7cb7f26

b8zhong requested review from Fridge003 and Copilot December 3, 2025 04:31

github-actions bot added documentation Improvements or additions to documentation deepseek labels Dec 3, 2025

Copilot started reviewing on behalf of b8zhong December 3, 2025 04:32 View session

Copilot finished reviewing on behalf of b8zhong December 3, 2025 04:35

Copilot AI reviewed Dec 3, 2025

View reviewed changes

b8zhong commented Dec 3, 2025

View reviewed changes

more

d42f272

b8zhong commented Dec 3, 2025

View reviewed changes

Fridge003 approved these changes Dec 3, 2025

View reviewed changes

Fridge003 merged commit 65c8568 into sgl-project:main Dec 3, 2025
45 checks passed

b8zhong deleted the brayden/sync-doc branch December 3, 2025 05:34

yingluosanqian pushed a commit to yingluosanqian/sglang that referenced this pull request Dec 4, 2025

sync attention, deepseek doc (sgl-project#14335)

a441392

Co-authored-by: Brayden Zhong <b8zhong@users.noreply.github.com>

tonyluj pushed a commit to openanolis/sglang that referenced this pull request Dec 5, 2025

sync attention, deepseek doc (sgl-project#14335)

47b22de

Co-authored-by: Brayden Zhong <b8zhong@users.noreply.github.com>

tonyluj pushed a commit to openanolis/sglang that referenced this pull request Dec 5, 2025

sync attention, deepseek doc (sgl-project#14335)

2554f2b

Co-authored-by: Brayden Zhong <b8zhong@users.noreply.github.com>

sunxxuns pushed a commit to sunxxuns/sglang that referenced this pull request Dec 5, 2025

sync attention, deepseek doc (sgl-project#14335)

234edd1

Co-authored-by: Brayden Zhong <b8zhong@users.noreply.github.com>

yuchengz816-bot pushed a commit to yuchengz816-bot/sglang that referenced this pull request Dec 8, 2025

sync attention, deepseek doc (sgl-project#14335)

a02d2cc

Co-authored-by: Brayden Zhong <b8zhong@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync attention, deepseek doc#14335

sync attention, deepseek doc#14335
Fridge003 merged 5 commits intosgl-project:mainfrom
bzhng-development:brayden/sync-doc

b8zhong commented Dec 3, 2025

Uh oh!

gemini-code-assist bot commented Dec 3, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 3, 2025

Uh oh!

b8zhong Dec 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

b8zhong Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

	\| FA4 \| 1 \| ❌ \| ❌ \| ❌ \| ❌ \|
	\| FA4 \| 128 \| ❌ \| ❌ \| ❌ \| ❌ \|

Conversation

b8zhong commented Dec 3, 2025

Uh oh!

gemini-code-assist bot commented Dec 3, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

b8zhong Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

b8zhong Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments