-
Notifications
You must be signed in to change notification settings - Fork 4.5k
[CPU] Optimize Qwen3-next model on CPU #12525
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Kangyan-Zhou
merged 50 commits into
sgl-project:main
from
jianan-gu:qwen-next-cpu-frontend
Jan 30, 2026
Merged
Changes from all commits
Commits
Show all changes
50 commits
Select commit
Hold shift + click to select a range
626ec0a
add CPU optimized frontend for qwen3-next
jianan-gu 7d7fa12
minor fix
jianan-gu b1472a1
memory pool changes for amx conv
jianan-gu 6be8b13
add TP padding for qwen3-next on CPU
jianan-gu 5564da4
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 7ee14bb
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 13571bd
fix lint
jianan-gu fef27aa
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 167a01d
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu bf1e05d
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 0d1559d
rebase with latest kernels
jianan-gu 4ed825a
Merge remote-tracking branch 'upstream/main' into qwen-next-cpu-frontend
jianan-gu eeebbb2
Update python/sglang/srt/layers/attention/intel_amx_backend.py
jianan-gu c032b1e
refine codes
jianan-gu ad1d6e2
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 51fc77d
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu dd8aa35
Merge branch 'main' into qwen-next-cpu-frontend
FlamingoPg ffb443b
Merge branch 'main' into qwen-next-cpu-frontend
FlamingoPg d476fcd
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 8a6dae6
minor fix after rebase
jianan-gu 742ea26
refine mamba apis
jianan-gu 188db21
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 855d43e
Merge branch 'main' into qwen-next-cpu-frontend
FlamingoPg 82ee88c
Merge branch 'main' into qwen-next-cpu-frontend
FlamingoPg 75ef3be
Merge remote-tracking branch 'upstream/main' into qwen-next-cpu-frontend
jianan-gu 24d3543
Merge remote-tracking branch 'upstream/main' into qwen-next-cpu-frontend
jianan-gu 421dbaa
Merge remote-tracking branch 'upstream/main' into qwen-next-cpu-frontend
jianan-gu c40ae9d
adjust mamba cache after rebase
jianan-gu c777aa3
minor refinements
jianan-gu cbf8adb
final minor refinements
jianan-gu b987530
format
jianan-gu 210adff
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 79b4a70
Merge remote-tracking branch 'upstream/main' into qwen-next-cpu-frontend
jianan-gu 5f89a7c
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 9520d90
rebase api
jianan-gu e008c0a
refine api
jianan-gu 9fb3ddf
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 5b80c61
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 9cc7b92
format after rebase
jianan-gu 2c5309a
minor refinements
jianan-gu 242f479
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 1a97f90
refinements per reviews
jianan-gu 2b19634
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 37fa1fb
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu fdfac34
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 9837203
Merge remote-tracking branch 'origin/main' into qwen-next-cpu-frontend
jianan-gu 7be31cb
minor refine after rebase
jianan-gu 19f1eda
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu 2e5a1c1
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu fd1eb27
Merge branch 'main' into qwen-next-cpu-frontend
jianan-gu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.