Add /rerun-stage slash command to rerun specific PR test stages by alisonshao · Pull Request #14262 · sgl-project/sglang

alisonshao · 2025-12-02T02:32:21Z

Adds a new slash command /rerun-stage <stage-name> that allows developers to run individual test stages immediately, skipping dependencies. This is perfect for quick iterations when fixing specific test failures.

Usage

/rerun-stage unit-test-backend-4-gpu

This will:

✅ Run only the 4-gpu test stage
✅ Skip waiting for 1-gpu and 2-gpu tests
✅ Run on your PR branch immediately
✅ Perfect for quick iteration cycles

How It Works

Uses workflow_dispatch to trigger a new workflow run with a target_stage parameter. The specified stage's job condition checks if it's the target and runs immediately, bypassing normal dependencies.

Currently Supported Stages

unit-test-backend-4-gpu

More stages can be easily added by updating their job conditions in pr-test.yml.

Benefits

Before: Fix 4-gpu bug → push → wait 30min for 1-gpu and 2-gpu → finally test 4-gpu
After: Fix 4-gpu bug → push → /rerun-stage unit-test-backend-4-gpu → test immediately! 🚀

Once you've validated the fix works, run the full CI to ensure everything passes.

Implementation

Added target_stage input to pr-test.yml workflow_dispatch
Updated unit-test-backend-4-gpu condition to run when it's the target
Modified slash command handler to trigger workflow_dispatch instead of rerunning jobs
Same permissions as /rerun-failed-ci

Adds a new slash command '/rerun-stage <stage-name>' that allows developers to rerun individual stages/jobs in the PR Test workflow. This is useful when fixing test failures, as it avoids having to rerun the entire test suite. Usage: /rerun-stage unit-test-backend-1-gpu /rerun-stage accuracy-test-1-gpu /rerun-stage quantization-test Features: - Only reruns the specified stage if it failed/skipped - Provides helpful error messages if stage name is wrong - Lists common stage names when stage not found - Same permissions as /rerun-failed-ci Changes: - Updated slash-command-handler.yml to recognize /rerun-stage - Added handle_rerun_stage() function in slash_command_handler.py - Added 'can_rerun_stage' permission to all users with rerun access

gemini-code-assist · 2025-12-02T02:32:24Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

alisonshao · 2025-12-02T02:44:54Z

Note: The /rerun-stage command won't work on this PR yet because issue_comment workflows always run from the default branch (main) for security reasons.

Once this PR is merged, the command will be available for all PRs. You can test it on any PR after merge by commenting:

/rerun-stage unit-test-backend-4-gpu

This is GitHub's standard behavior to prevent malicious PRs from executing arbitrary workflow code.

Changed approach from rerunning failed jobs to triggering a new workflow run with target_stage parameter. This allows running a specific stage immediately without waiting for its dependencies to pass. Changes: - Added target_stage input to pr-test.yml workflow_dispatch - Updated unit-test-backend-4-gpu condition to run when target_stage matches - Modified handle_rerun_stage() to use workflow_dispatch instead of rerun - Stage runs independently on PR branch, skipping 1-gpu and 2-gpu dependencies Usage: /rerun-stage unit-test-backend-4-gpu Result: Runs only 4-gpu test immediately, perfect for quick iterations

alisonshao · 2025-12-02T02:57:28Z

Updated implementation to use workflow_dispatch with target_stage parameter. The specified stage now runs immediately without waiting for dependencies.

Example: /rerun-stage unit-test-backend-4-gpu triggers only the 4-gpu test, skipping 1-gpu and 2-gpu.

Supported stages now: - unit-test-backend-2-gpu - unit-test-backend-4-gpu - unit-test-backend-8-gpu-h200 - unit-test-backend-8-gpu-h20 All stages can now be triggered independently without waiting for dependencies.

alisonshao · 2025-12-02T03:21:27Z

Added support for multiple GPU stages. Now supports:

unit-test-backend-2-gpu
unit-test-backend-4-gpu
unit-test-backend-8-gpu-h200
unit-test-backend-8-gpu-h20

All stages run independently without waiting for dependencies when triggered via /rerun-stage.

alisonshao · 2025-12-02T03:28:25Z

Working example workflow run:

https://github.com/sgl-project/sglang/actions/runs/19845385131

This run was triggered with: gh workflow run "PR Test" --ref feat/add-rerun-stage-slash-command -f version=release -f target_stage=unit-test-backend-4-gpu

Shows the unit-test-backend-4-gpu stage running independently without waiting for 1-gpu and 2-gpu dependencies.

alisonshao · 2025-12-02T03:29:29Z

/tag-and-rerun-ci

Kangyan-Zhou · 2025-12-02T03:36:52Z

.github/workflows/pr-test.yml

Can we apply the change to all the stages as well>

change is now applied to all stages

Updates the feature to allow triggering any test stage independently, not just backend GPU tests. Added support for all stages: - stage-a-test-1 - multimodal-gen-test-1-gpu, multimodal-gen-test-2-gpu - quantization-test - unit-test-backend-1-gpu, unit-test-backend-2-gpu, unit-test-backend-4-gpu - unit-test-backend-8-gpu-h200, unit-test-backend-8-gpu-h20 - performance-test-1-gpu-part-1, performance-test-1-gpu-part-2, performance-test-1-gpu-part-3 - performance-test-2-gpu - accuracy-test-1-gpu, accuracy-test-2-gpu - unit-test-deepep-4-gpu, unit-test-deepep-8-gpu - unit-test-backend-4-gpu-b200, unit-test-backend-4-gpu-gb200

…project#14262)

alisonshao requested review from Fridge003, Kangyan-Zhou, ispobock and merrymercy as code owners December 2, 2025 02:32

This comment was marked as outdated.

Sign in to view

Fix black formatting

f54c333

alisonshao added 2 commits December 1, 2025 19:12

Simplify condition to allow target_stage to skip dependency checks

4d83d29

Add support for 2-gpu and 8-gpu stages in /rerun-stage

ef38bab

Supported stages now: - unit-test-backend-2-gpu - unit-test-backend-4-gpu - unit-test-backend-8-gpu-h200 - unit-test-backend-8-gpu-h20 All stages can now be triggered independently without waiting for dependencies.

github-actions bot added the run-ci label Dec 2, 2025

Kangyan-Zhou reviewed Dec 2, 2025

View reviewed changes

alisonshao and others added 4 commits December 1, 2025 19:44

Merge branch 'main' into feat/add-rerun-stage-slash-command

96be8ae

Merge branch 'main' into feat/add-rerun-stage-slash-command

7eb31be

Merge branch 'main' into feat/add-rerun-stage-slash-command

27541e2

Kangyan-Zhou approved these changes Dec 2, 2025

View reviewed changes

Kangyan-Zhou merged commit 084b06e into main Dec 2, 2025
63 of 75 checks passed

Kangyan-Zhou deleted the feat/add-rerun-stage-slash-command branch December 2, 2025 22:23

harvenstar pushed a commit to harvenstar/sglang that referenced this pull request Dec 4, 2025

Add /rerun-stage slash command to rerun specific PR test stages (sgl-…

182196c

…project#14262)

yingluosanqian pushed a commit to yingluosanqian/sglang that referenced this pull request Dec 4, 2025

Add /rerun-stage slash command to rerun specific PR test stages (sgl-…

ac25e90

…project#14262)

tonyluj pushed a commit to openanolis/sglang that referenced this pull request Dec 5, 2025

Add /rerun-stage slash command to rerun specific PR test stages (sgl-…

570ac39

…project#14262)

alisonshao mentioned this pull request Dec 5, 2025

[Docs] Add /rerun-stage command to contribution guide #14521

Merged

yuchengz816-bot pushed a commit to yuchengz816-bot/sglang that referenced this pull request Dec 8, 2025

Add /rerun-stage slash command to rerun specific PR test stages (sgl-…

591e43e

…project#14262)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add /rerun-stage slash command to rerun specific PR test stages#14262

Add /rerun-stage slash command to rerun specific PR test stages#14262
Kangyan-Zhou merged 9 commits intomainfrom
feat/add-rerun-stage-slash-command

alisonshao commented Dec 2, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Dec 2, 2025

Uh oh!

This comment was marked as outdated.

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

Kangyan-Zhou Dec 2, 2025

Uh oh!

alisonshao Dec 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

alisonshao commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Usage

How It Works

Currently Supported Stages

Benefits

Implementation

Uh oh!

gemini-code-assist bot commented Dec 2, 2025

Uh oh!

This comment was marked as outdated.

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

alisonshao commented Dec 2, 2025

Uh oh!

Kangyan-Zhou Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

alisonshao Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alisonshao commented Dec 2, 2025 •

edited

Loading