Expose --qparams_algorithm CLI arg for quantization parameter selection by rhn19 · Pull Request #227 · huggingface/optimum-executorch

rhn19 · 2026-04-13T06:52:43Z

Fixes #226

Problem

quantize_model_() accepts a qparams_algorithm parameter, but there is no CLI argument to pass it through. Users cannot control which algorithm is used for computing quantization parameters during export.

Changes

Add --qparams_algorithm CLI argument (choices=["affine", "hqq_scale_only"])
Wire it through all task loaders (causal_lm, masked_lm, asr, multimodal_text_to_text) to quantize_model_()

Usage

optimum-cli export executorch \
   --model google/gemma-3-1b-it \
   --task text-generation --recipe xnnpack \
   --qlinear 8da4w --qparams_algorithm affine \
   --output_dir output/

No behavior change when --qparams_algorithm is not specified.

Expose --qparams_algorithm CLI arg for quantization parameter selection

d32d345

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose --qparams_algorithm CLI arg for quantization parameter selection#227

Expose --qparams_algorithm CLI arg for quantization parameter selection#227
rhn19 wants to merge 1 commit intohuggingface:mainfrom
rhn19:fix/qat-default-qparams-algorithm

rhn19 commented Apr 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rhn19 commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Changes

Usage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rhn19 commented Apr 13, 2026 •

edited

Loading