Normalize router weights in MoE OP by jkaniecki · Pull Request #72 · HabanaAI/vllm-fork

jkaniecki · 2024-06-26T07:22:37Z

Adds router weights normlization to improve mixtral accuracy.

szutenberg

LGTM! Please create PR also to habana_next branch. Thanks!

remove expert_max hard code (#47) vLLM-Ext: Full enabling of ALiBi (#34) Add version inference via setuptools-scm (#58) Revert "vLLM-Ext: Full enabling of ALiBi (#34)" (#59) Remove punica_hpu.py from vllm_hpu_extension (#66) Removed previous (not-pipelined) pa implementation (#72) Add flag to enable running softmax in fp32 (#71) Update calibration readme link (#73) allow lm_head quantization in calibration process (#65) Pad to bmin if value is less (#67) Update pyproject.toml (#75) --------- Co-authored-by: Michał Kuligowski <mkuligowski@habana.ai>

Enable DeepseekV2 Lite/Chat models (HabanaAI#516)

Update ops.py

3013ec0

szutenberg approved these changes Jun 26, 2024

View reviewed changes

szutenberg requested a review from kzawora-intel June 26, 2024 07:24

kzawora-intel approved these changes Jun 26, 2024

View reviewed changes

kzawora-intel merged commit 2728599 into HabanaAI:habana_main Jun 26, 2024

mfylcek mentioned this pull request Jan 14, 2025

Set vllm-hpu-extension to 6ac93fb #684

Merged

michalkuligowski mentioned this pull request Jan 15, 2025

Update requirements-hpu.txt #685

Closed

ranzhejiang pushed a commit to ranzhejiang/vllm-fork that referenced this pull request Apr 11, 2025

Merge pull request HabanaAI#72 from hlin99/r1.19

bab6893

Enable DeepseekV2 Lite/Chat models (HabanaAI#516)

SupreetSinghPalne added a commit that referenced this pull request Aug 21, 2025

Port Lookahead decoding PR #72 from vllm-gaudi

ec1b9ef

SupreetSinghPalne mentioned this pull request Aug 21, 2025

Port lookahead #1796

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize router weights in MoE OP#72

Normalize router weights in MoE OP#72
kzawora-intel merged 1 commit intoHabanaAI:habana_mainfrom
jkaniecki:Normalise_router_weights

jkaniecki commented Jun 26, 2024

Uh oh!

szutenberg left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jkaniecki commented Jun 26, 2024

Uh oh!

szutenberg left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants