-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Closed
Labels
Description
Hi, thank you for your awesome contributions to open source!
I'd like to benchmark speculative decoding vocabulary pruning (#3822, https://docs.sglang.ai/backend/speculative_decoding.html#EAGLE-2-Decoding-via-Frequency-Ranked-Speculative-Sampling). What would be the most appropriate benchmarks from https://docs.sglang.ai/references/benchmark_and_profiling.html to use?
In other words, what benchmarks would you like to see before merging a PR that includes an improvement over the existing method?
Reactions are currently unavailable