Update Force Channel FP8 Check by yiliu30 · Pull Request #1561 · HabanaAI/vllm-fork

yiliu30 · 2025-07-10T02:14:25Z

Starting with 1.22, INC also supports dynamic quantization. To make things smoother for users, and after discussing with @xuechendi , we plan to update how we handle the force channel fp8 check:

If the user provides a QUANT_CONFIG, we assume the intention is to use INC for either dynamic or static quantization.
If no QUANT_CONFIG is provided but the user passes an FP8 model, the workflow defaults to the built-in dynamic quantization path w/o INC.

cc @thuang6 @czhu15 @yangulei

Signed-off-by: yiliu30 <yi4.liu@intel.com>

xuechendi · 2025-07-10T23:56:16Z

/run-gaudi-tests

czhu15

LGTM

Porting #1561 Signed-off-by: yiliu30 <yi4.liu@intel.com>

update force channel fp8 check

0496113

Signed-off-by: yiliu30 <yi4.liu@intel.com>

yiliu30 requested review from PatrykWo, afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners July 10, 2025 02:14

This was referenced Jul 10, 2025

Update Force Channel FP8 Check #1562

Closed

Update Force Channel FP8 Check #1563

Merged

czhu15 reviewed Jul 10, 2025

View reviewed changes

Comment thread vllm/envs.py

mfylcek approved these changes Jul 10, 2025

View reviewed changes

Merge branch 'habana_main' into update-force-fp8-check

c9672a4

xuechendi approved these changes Jul 10, 2025

View reviewed changes

yiliu30 requested a review from czhu15 July 11, 2025 01:31

czhu15 approved these changes Jul 11, 2025

View reviewed changes

Comment thread vllm/envs.py

yiliu30 merged commit 7205441 into habana_main Jul 11, 2025
53 checks passed

yiliu30 deleted the update-force-fp8-check branch July 11, 2025 03:32

michalkuligowski pushed a commit that referenced this pull request Jul 15, 2025

Update Force Channel FP8 Check (#1563)

bdd9171

Porting #1561 Signed-off-by: yiliu30 <yi4.liu@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Force Channel FP8 Check#1561

Update Force Channel FP8 Check#1561
yiliu30 merged 2 commits intohabana_mainfrom
update-force-fp8-check

yiliu30 commented Jul 10, 2025 •

edited by github-actions Bot

Loading

Uh oh!

Uh oh!

xuechendi commented Jul 10, 2025

Uh oh!

czhu15 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yiliu30 commented Jul 10, 2025 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

xuechendi commented Jul 10, 2025

Uh oh!

czhu15 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yiliu30 commented Jul 10, 2025 •

edited by github-actions Bot

Loading