-
Notifications
You must be signed in to change notification settings - Fork 4.6k
Closed
Description
I expect the time we spend on "verify" part should be close to a normal decode forward (less than 100ms, my setting is bs=16 and ctx=12k), but now it takes about 400ms. It slows down my output throughput severely. Seems like a kernel performance issue?
The commit I test:
commit 4a05bdf (gh/main)
Author: Lianmin Zheng lianminzheng@gmail.com
Date: Sun Mar 9 18:53:33 2025 -0700
Revert "Check eagle server args" (#4242)
Originally posted by @jokerwyt in #3582 (comment)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels

