[question] appropriate benchmarks

Hi, thank you for your awesome contributions to open source!

I'd like to benchmark speculative decoding vocabulary pruning (https://github.com/sgl-project/sglang/pull/3822, https://docs.sglang.ai/backend/speculative_decoding.html#EAGLE-2-Decoding-via-Frequency-Ranked-Speculative-Sampling). What would be the most appropriate benchmarks from https://docs.sglang.ai/references/benchmark_and_profiling.html to use?

In other words, what benchmarks would you like to see before merging a PR that includes an improvement over the existing method?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] appropriate benchmarks #8391

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[question] appropriate benchmarks #8391

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions