Skip to content

[question] appropriate benchmarks #8391

@keyboardAnt

Description

@keyboardAnt

Hi, thank you for your awesome contributions to open source!

I'd like to benchmark speculative decoding vocabulary pruning (#3822, https://docs.sglang.ai/backend/speculative_decoding.html#EAGLE-2-Decoding-via-Frequency-Ranked-Speculative-Sampling). What would be the most appropriate benchmarks from https://docs.sglang.ai/references/benchmark_and_profiling.html to use?

In other words, what benchmarks would you like to see before merging a PR that includes an improvement over the existing method?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions