Allow benchmarking each forward pass in e2e systems by fzyzcjy · Pull Request #4666 · sgl-project/sglang

fzyzcjy · 2025-03-22T07:25:50Z

Motivation

On one hand, bench_one_batch is great, but it is somehow buggy for complex scenarios (e.g. DeepSeek V3), partially because it bypasses all logic in real schedulers. On the other hand, bench_serving is great, but only provides one single e2e latency number, so we cannot know the breakdown of the number. Therefore, I made this tiny PR to allow bench_one_batch_server to provide extra numbers about each single forward pass.

Example command

SGLANG_FINE_GRAINED_BENCHMARK_DIR=/tmp/sglang_fine_grained_benchmark python -m sglang.bench_one_batch_server --model-path deepseek-ai/DeepSeek-V2-Lite --trust-remote-code --tp 2 --dp 2 --enable-dp-attention --enable-deepep-moe --disable-cuda-graph --batch-size 4 16 64 256 --input-len 1024 --output-len 2 --port 5678

Example output

  forward_mode    throughput   latency  batch_size  num_tokens
0       EXTEND  18538.560266  0.110472           2        2048
1       DECODE     21.296126  0.093914           2           2
2       DECODE     28.449750  0.070299           2           2

This PR is based on #4699, so please subtract code diff from there

Modifications

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

This reverts commit 4240966.

fzyzcjy · 2025-04-01T03:56:49Z

Ping me when this PR is to be merged - currently I only resolve conflicts in #4068, and will port the resolve code back here when pinged.

fzyzcjy added 30 commits March 22, 2025 14:05

more

4240966

more

89cf9e7

more

c811d72

more

4f7aa16

more

a25e81c

more

201b1e5

more

0196b31

more

467fb23

more

760b204

more

a1cb786

Revert "more"

a193655

This reverts commit 4240966.

more

b38da28

more

875ac2a

more

ea8040a

more

4575c9e

more

431e833

more

6a2a8ce

more

1976494

more

d2f55af

more

aa63d80

more

4feba1a

more

d9e7ec4

more

dcf9156

more

ce7d33e

more

a270f54

more

c909586

more

6b9263d

more

a228e96

more

4d24f19

more

997519f

fzyzcjy added 27 commits March 23, 2025 18:10

more

2bea87e

more

49e1a7d

more

3238851

more

768ef73

more

08f0936

more

df77161

more

9de9ac0

more

a9ccda9

more

955a45f

more

197575e

more

2f29709

more

44bce35

more

904f6b3

more

83345d7

more

6919496

more

e751184

more

5a8a8e2

more

7daf004

more

47cba32

more

207fc72

fmt

b8ec9cc

Merge branch 'feat/colocate_batch_gen' into feat/fine_grained_benchmark

f8e6cb3

more

73eee1f

more

55f51a2

Merge branch 'feat/colocate_batch_gen' into feat/fine_grained_benchmark

cb8ca63

more

1e4b2e2

Merge branch 'feat/colocate_batch_gen' into feat/fine_grained_benchmark

3b361cc

Merge branch 'main' into feat/fine_grained_benchmark

74b507c

merrymercy requested a review from Fridge003 as a code owner November 29, 2025 07:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow benchmarking each forward pass in e2e systems#4666

Allow benchmarking each forward pass in e2e systems#4666
fzyzcjy wants to merge 80 commits intosgl-project:mainfrom
fzyzcjy:feat/fine_grained_benchmark

fzyzcjy commented Mar 22, 2025 •

edited

Loading

Uh oh!

fzyzcjy commented Apr 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fzyzcjy commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Checklist

Uh oh!

fzyzcjy commented Apr 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fzyzcjy commented Mar 22, 2025 •

edited

Loading