Skip to content

Fix quantization and nightly tests#4258

Merged
merrymercy merged 4 commits intomainfrom
pr-fix-compressed-fp8
Mar 10, 2025
Merged

Fix quantization and nightly tests#4258
merrymercy merged 4 commits intomainfrom
pr-fix-compressed-fp8

Conversation

@merrymercy
Copy link
Contributor

@merrymercy merrymercy commented Mar 10, 2025

Monkey patch functions like AWQMoEMethod from vllm to align the arguments

python -m sglang.launch_server --model-path cognitivecomputations/DeepSeek-V3-AWQ --tp-size 8

@merrymercy merrymercy merged commit 00d25a7 into main Mar 10, 2025
22 of 23 checks passed
@merrymercy merrymercy deleted the pr-fix-compressed-fp8 branch March 10, 2025 10:06
aoshen524 pushed a commit to aoshen524/sglang that referenced this pull request Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant