Skip to content

Fix missing padding_side=left for batch inference#1296

Open
huang7loong wants to merge 1 commit intoQwenLM:mainfrom
huang7loong:fix/batch_inference
Open

Fix missing padding_side=left for batch inference#1296
huang7loong wants to merge 1 commit intoQwenLM:mainfrom
huang7loong:fix/batch_inference

Conversation

@huang7loong
Copy link

This PR fixes an issue in the Batch Inference example in README.md, where the padding_side="left" parameter was missing.

For decoder-only LLMs (e.g., GPT-style models), it's important to set padding_side to "left" during batch inference. Without this setting, the model may behave unexpectedly or generate incorrect outputs, e.g., #759

This change ensures the example runs correctly and avoids confusion for users.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant