docs_new/docs/sglang-diffusion/index.mdx
SGLang Diffusion is a high-performance inference framework for image and video generation. It provides native SGLang pipelines, diffusers backend support, an OpenAI-compatible server, and an optimized kernel stack built on both precompiled sgl-kernel operators and JIT kernels for key inference paths.
sgl-kernel, JIT kernels, scheduler improvements, and caching accelerationsglang generate, sglang serve, and an OpenAI-compatible APIuv pip install "sglang[diffusion]" --prerelease=allow
sglang generate --model-path Qwen/Qwen-Image \
--prompt "A beautiful sunset over the mountains" \
--save-output
sglang serve --model-path Qwen/Qwen-Image --port 30010