Skip to content

Fix bs=2 prefill bucketing weirdness #35

Merged
kzawora-intel merged 1 commit intomainfrom
private/kzawora/prefill_bucketing
Jul 17, 2025
Merged

Fix bs=2 prefill bucketing weirdness #35
kzawora-intel merged 1 commit intomainfrom
private/kzawora/prefill_bucketing

Conversation

@kzawora-intel
Copy link
Copy Markdown
Contributor

@kzawora-intel kzawora-intel commented Jul 17, 2025

ripped from: HabanaAI/vllm-fork#1606, fixes weird bucketing anomaly where bs=1 prefills would be padded to bs=2 and trigger a recompilation

Signed-off-by: Konrad Zawora <kzawora@habana.ai>
@kzawora-intel kzawora-intel force-pushed the private/kzawora/prefill_bucketing branch from e5e2414 to c4b4364 Compare July 17, 2025 08:24
@kzawora-intel kzawora-intel enabled auto-merge (squash) July 17, 2025 08:57
@kzawora-intel kzawora-intel disabled auto-merge July 17, 2025 08:58
@kzawora-intel kzawora-intel merged commit d1c0283 into main Jul 17, 2025
3 checks passed
@kzawora-intel kzawora-intel deleted the private/kzawora/prefill_bucketing branch July 28, 2025 10:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant