docs_new/docs/sglang-diffusion/ring_sp_performance.mdx
This page reports Ring-SP performance for Wan2.2-TI2V-5B-Diffusers using:
sp=2, ulysses=1, ring=2 (short: u1r2)sp=1, ulysses=1, ring=1 (short: u1r1)Wan2.2-TI2V-5B-Diffusers48G RTX40 series * 2u1r2)sglang serve \
--model-type diffusion \
--model-path /model/HuggingFace/Wan-AI/Wan2.2-TI2V-5B-Diffusers \
--num-gpus 2 --sp-degree 2 --ulysses-degree 1 --ring-degree 2 \
--port 8898
u1r1)sglang serve \
--model-type diffusion \
--model-path /model/HuggingFace/Wan-AI/Wan2.2-TI2V-5B-Diffusers \
--num-gpus 1 --sp-degree 1 --ulysses-degree 1 --ring-degree 1 \
--port 8898
These benchmarks are provided for reference under one specific setup and command configuration. Actual performance may vary with model settings, runtime environment, and request patterns.
90.63s to 63.74s (1.42x).Denoising (1.36x) and Decoding (1.75x).Peak GPU Memory -7.33GB, Peak Allocated -7.05GB).+7.9pp), so future tuning can focus on reducing communication/runtime overhead while preserving the latency gain.