[diffusion] benchmark: enable performance benchmark with warmup by fsygd · Pull Request #15773 · sgl-project/sglang

fsygd · 2025-12-24T15:35:36Z

Motivation

Performance benchmark will be better with warmup to get more accurate performance, especially when something like DeepGemm is involved. (e.g. #15403)

without warmup on H200:

with warmup on H100:

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.
Work with maintainers to merge your PR. See the PR Merge Process

gemini-code-assist · 2025-12-24T15:35:40Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

python/sglang/multimodal_gen/docs/contributing.md

mickqian · 2025-12-25T16:20:16Z

python/sglang/srt/managers/schedule_batch.py

            return

-        bootstrap_info = f", bootstrap_room={self.bootstrap_room}" if self.bootstrap_room is not None else ""
+        bootstrap_info = (


why this😂

Pre-commit did this, not me. Should I rollback?

mickqian · 2025-12-25T16:20:48Z

@BBuf would you take a look?

fsygd · 2025-12-27T02:06:01Z

Any blocker？ @mickqian

BBuf · 2025-12-29T01:08:12Z

If warmup is enabled by default here, it might mask some performance issues that would otherwise be discoverable, such as the problem found previously here: #15511. For offline inference, you must accept the warmup cost. For serving, the first request also has to accept the warmup cost. For subsequent requests, if the resolution changes or similar things happen, deepgemm still needs to recompile, which also has a cost. These should not be removed, especially for profiling.

fsygd · 2025-12-29T02:39:44Z

If warmup is enabled by default here, it might mask some performance issues that would otherwise be discoverable, such as the problem found previously here: #15511. For offline inference, you must accept the warmup cost. For serving, the first request also has to accept the warmup cost. For subsequent requests, if the resolution changes or similar things happen, deepgemm still needs to recompile, which also has a cost. These should not be removed, especially for profiling.

I see, warmup by default may hide something we concern. May i add warmup for option maybe sometimes we need it, or just close the pr?

fsygd · 2025-12-30T11:38:04Z

Warmup is an option now, PTAL @mickqian @BBuf @RubiaCx @FlamingoPg

fsygd requested review from mickqian and yhyang201 as code owners December 24, 2025 15:35

github-actions bot added documentation Improvements or additions to documentation diffusion SGLang Diffusion labels Dec 24, 2025

mickqian reviewed Dec 25, 2025

View reviewed changes

python/sglang/multimodal_gen/docs/contributing.md Outdated Show resolved Hide resolved

fsygd requested review from Ying1123, hnyls2002, merrymercy and xiezhq-hermann as code owners December 25, 2025 07:46

fsygd requested a review from mickqian December 25, 2025 07:58

mickqian reviewed Dec 25, 2025

View reviewed changes

fsygd requested a review from mickqian December 27, 2025 15:31

fsygd closed this Dec 29, 2025

fsygd reopened this Dec 30, 2025

fsygd force-pushed the run-benchmark-with-warmup branch from 51a933c to 876f881 Compare December 30, 2025 09:09

fsygd added 4 commits December 30, 2025 09:13

[diffusion] benchmark: enable performance benchmark with warmup

3c578de

Do warmup by generate cli automatically

c0f76b9

format

e274da2

enable warmup as a option

fa6535d

fsygd force-pushed the run-benchmark-with-warmup branch from 876f881 to fa6535d Compare December 30, 2025 11:22

format

4f15ccb

tom-jerr mentioned this pull request Dec 31, 2025

[diffusion] pipeline: lightweight warmup, denoising stage only, 1-step #14410

Closed

6 tasks

mickqian mentioned this pull request Dec 31, 2025

[diffusion] feat: support lightweight e2e warmup for benchmarking #16213

Merged

5 tasks

mickqian closed this in #16213 Jan 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[diffusion] benchmark: enable performance benchmark with warmup#15773

[diffusion] benchmark: enable performance benchmark with warmup#15773
fsygd wants to merge 5 commits intosgl-project:mainfrom
fsygd:run-benchmark-with-warmup

fsygd commented Dec 24, 2025

Uh oh!

gemini-code-assist bot commented Dec 24, 2025

Uh oh!

Uh oh!

mickqian Dec 25, 2025

Uh oh!

fsygd Dec 26, 2025

Uh oh!

mickqian commented Dec 25, 2025

Uh oh!

fsygd commented Dec 27, 2025

Uh oh!

BBuf commented Dec 29, 2025

Uh oh!

fsygd commented Dec 29, 2025

Uh oh!

fsygd commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

fsygd commented Dec 24, 2025

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Dec 24, 2025

Uh oh!

Uh oh!

mickqian Dec 25, 2025

Choose a reason for hiding this comment

Uh oh!

fsygd Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

mickqian commented Dec 25, 2025

Uh oh!

fsygd commented Dec 27, 2025

Uh oh!

BBuf commented Dec 29, 2025

Uh oh!

fsygd commented Dec 29, 2025

Uh oh!

fsygd commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments