Skip to content

[ModelOpt] Introduce VLLM_MAX_TOKENS_PER_EXPERT_FP4_MOE env var to control blockscale tensor allocation#18160

Merged
vllm-bot merged 4 commits intovllm-project:mainfrom
pavanimajety:fix-env-max-toks-p-exp
May 23, 2025
Merged

[ModelOpt] Introduce VLLM_MAX_TOKENS_PER_EXPERT_FP4_MOE env var to control blockscale tensor allocation#18160
vllm-bot merged 4 commits intovllm-project:mainfrom
pavanimajety:fix-env-max-toks-p-exp

Commits

Commits on May 14, 2025

Commits on May 16, 2025