-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Closed
Description
Hi sglang team! We want to use Mooncake nvlink transport for PD disaggregation deployment on the same node.
According to the PR #11321, we should set the following env:
export SGLANG_MOONCAKE_CUSTOM_MEM_POOL=True
export MC_FORCE_MNNVL=True
But in the PR #7264, we see that the SGLANG_MOONCAKE_CUSTOM_MEM_POOL is used for cross-node NVLink transport.
Motivation
This PR adds an env var option: SGLANG_MOONCAKE_CUSTOM_MEM_POOL, if it is set to true, mooncake will initialize the kvcache in a custom memory pool to enable cross-node NVLink transport.
This feature requires mooncake-transfer-engine >= 0.3.3.post2
That is to say, if we use PD disaggregation on the same node with mooncake nvlink transport, we just need to set:
export MC_FORCE_MNNVL=True
Right?
Thanks for your help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels