[sgl-kernel] fix b200 kernel ci by FlamingoPg · Pull Request #13907 · sgl-project/sglang

FlamingoPg · 2025-11-25T08:35:59Z

Motivation

Fix failed B200 CI, partly adapted from #13731

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.
Work with maintainers to merge your PR. See the PR Merge Process

Co-authored-by: HydraQYH <qyh820@outlook.com>

gemini-code-assist · 2025-11-25T08:36:14Z

Summary of Changes

Hello @FlamingoPg, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses and resolves continuous integration (CI) failures on B200 systems by refining the device capability checks within kernel tests. The changes ensure that specific FP8-related kernel operations and FlashMLA tests are only executed on hardware that explicitly supports the required SM90 architecture, thereby preventing failures on incompatible systems and improving the reliability of the CI pipeline.

Highlights

Corrected Device Support for FP8 MoE Tests: The test_es_fp8_blockwise_moe.py test now correctly specifies that es_fp8_blockwise_scaled_grouped_mm is exclusively supported on SM90 architectures, removing SM100 from the skip condition.
Introduced SM90 Support Utility: A new helper function, is_sm90_supported, has been added to test_flashmla.py to programmatically check for SM90 device capability and a CUDA version of 12.3 or higher.
Applied SM90 Requirement to FlashMLA Tests: FP8-related FlashMLA tests, specifically test_flashmla_prefill and test_flash_mla_decode, are now conditionally skipped if the running environment does not support SM90, ensuring these tests only execute on compatible hardware.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request aims to fix CI failures on B200 (SM100) hardware. The changes involve disabling some tests for FP8 kernels on SM100 architecture by restricting them to SM90, likely as a temporary measure to get the CI pipeline passing. Additionally, a helper function to check for SM90 support has been added to test_flashmla.py, which is a duplicate of an existing function. My feedback focuses on addressing this code duplication to improve maintainability.

gemini-code-assist · 2025-11-25T08:37:12Z

sgl-kernel/tests/test_flashmla.py

+def is_sm90_supported(device=None) -> bool:
+    return (torch.cuda.get_device_capability(device)[0] == 9) and (
+        torch.version.cuda >= "12.3"
+    )


This is_sm90_supported function is a duplicate of the one in sgl-kernel/tests/test_es_fp8_blockwise_moe.py. To follow the DRY (Don't Repeat Yourself) principle and improve maintainability, this function should be defined in a single, shared location, such as a test utilities file (e.g., sgl-kernel/tests/utils.py or sgl-kernel/tests/conftest.py), and imported into both test modules.

.github/workflows/pr-test.yml

Fridge003 · 2025-11-25T19:09:54Z

@FlamingoPg Please fix this
https://github.com/sgl-project/sglang/actions/runs/19680850427/workflow

Co-authored-by: HydraQYH <qyh820@outlook.com>

fix b200 kernel ci

11b7a23

Co-authored-by: HydraQYH <qyh820@outlook.com>

FlamingoPg requested review from BBuf, HaiShaw, ispobock, merrymercy, yizhang2077 and zhyncs as code owners November 25, 2025 08:36

FlamingoPg self-assigned this Nov 25, 2025

github-actions bot added the sgl-kernel label Nov 25, 2025

FlamingoPg added run-ci and removed sgl-kernel labels Nov 25, 2025

gemini-code-assist bot reviewed Nov 25, 2025

View reviewed changes

Merge branch 'main' into b200-ci

386f966

github-actions bot added the sgl-kernel label Nov 25, 2025

reopen sgl-kernel b200 test

7a00ba6

FlamingoPg requested review from Fridge003 and Kangyan-Zhou as code owners November 25, 2025 08:43

Fridge003 reviewed Nov 25, 2025

View reviewed changes

.github/workflows/pr-test.yml Show resolved Hide resolved

FlamingoPg and others added 2 commits November 25, 2025 17:40

Uncomment sgl-kernel-b200-test steps in workflow

5b61577

Merge branch 'main' into b200-ci

8783dbe

FlamingoPg added the format Auto Format Code label Nov 25, 2025

Merge branch 'main' into b200-ci

00f6140

FlamingoPg and others added 3 commits November 26, 2025 16:58

fix workflow

fd8e3ea

Merge branch 'main' into b200-ci

92443ea

Merge branch 'main' into b200-ci

3bbe5a9

Fridge003 approved these changes Nov 30, 2025

View reviewed changes

Fridge003 merged commit 412160f into sgl-project:main Nov 30, 2025
61 of 86 checks passed

harvenstar pushed a commit to harvenstar/sglang that referenced this pull request Dec 4, 2025

[sgl-kernel] fix b200 kernel ci (sgl-project#13907)

2542bff

Co-authored-by: HydraQYH <qyh820@outlook.com>

tonyluj pushed a commit to openanolis/sglang that referenced this pull request Dec 5, 2025

[sgl-kernel] fix b200 kernel ci (sgl-project#13907)

8f3285f

Co-authored-by: HydraQYH <qyh820@outlook.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[sgl-kernel] fix b200 kernel ci#13907

[sgl-kernel] fix b200 kernel ci#13907
Fridge003 merged 9 commits intosgl-project:mainfrom
FlamingoPg:b200-ci

FlamingoPg commented Nov 25, 2025

Uh oh!

gemini-code-assist bot commented Nov 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 25, 2025

Uh oh!

Uh oh!

Fridge003 commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

FlamingoPg commented Nov 25, 2025

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Nov 25, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Fridge003 commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants