[diffusion] doc: add vae path to cli doc#14004 (sgl-project#14355)

baonudesifeizhai · BBuf · mickqian · tonyluj · commit 99f16805342e · 2025-12-05T12:24:14.000+08:00
Co-authored-by: BBuf &lt;1182563586@qq.com&gt;
Co-authored-by: Mick &lt;mickjagger19@icloud.com&gt;
diff --git a/python/sglang/multimodal_gen/docs/cli.md b/python/sglang/multimodal_gen/docs/cli.md
@@ -13,6 +13,7 @@ The SGLang-diffusion CLI provides a quick way to access the inference pipeline f
 ### Server Arguments
 
 - `--model-path {MODEL_PATH}`: Path to the model or model ID
+- `--vae-path {VAE_PATH}`: Path to a custom VAE model or HuggingFace model ID (e.g., `fal/FLUX.2-Tiny-AutoEncoder`). If not specified, the VAE will be loaded from the main model path.
 - `--num-gpus {NUM_GPUS}`: Number of GPUs to use
 - `--tp-size {TP_SIZE}`: Tensor parallelism size (only for the encoder; should not be larger than 1 if text encoder offload is enabled, as layer-wise offload plus prefetch is faster)
 - `--sp-size {SP_SIZE}`: Sequence parallelism size (typically should match the number of GPUs)