-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Description
Checklist
- 1. I have searched related issues but cannot get the expected help.
- 2. The bug has not been fixed in the latest version.
- 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
- 4. If the issue you raised is not a bug but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- 5. Please use English, otherwise it will be closed.
Describe the bug
An exception occurs when initializing the scheduler process while loading the DeepSeek-V2 model. The error originates from the load_weights call in deepseek_v2.py, where maybe_executor_submit() is invoked with unexpected arguments.
Reproduction
python -m sglang.launch_server --model-path /modelpath/DeepSeek-V3.2/Exp-FP4 --served-model-name DeepSeek-V3.2-Exp-FP4 --tp 8 --dp 8 --enable-dp-attention --reasoning-parser deepseek-v3 --kv-cache-dtype fp8_e4m3 --modelopt-quant nvfp4 --trust-remote-code --disable-radix-cache --speculative-algorithm EAGLE --speculative-num-steps 3 --speculative-eagle-topk 1 --speculative-num-draft-tokens 4 --max-prefill-tokens 8192 --grammar-backend xgrammar --chat-template /sgl-workspace/sglang/examples/chat_template/tool_chat_template_deepseekv32.jinja
Environment
docker pull lmsysorg/sglang:latest , built in Nov 10
sha256:3bf326f18348e9f33b70462a92d764f04dc6a44a34d2413a661c07d5c67d845c