A community-maintained repository of practical guides and recipes for deploying and using SGLang in production environments. Our mission is simple: answer the question "How do I use SGLang (and related models) on hardware Y for task Z?" with clear, actionable solutions.

🎯 What You'll Find Here

This cookbook aggregates battle-tested SGLang recipes covering:

Models: Mainstream LLMs and Vision-Language Models (VLMs)
Use Cases: Inference serving, deployment strategies, multimodal applications
Hardware: GPU and CPU configurations, optimization for different accelerators
Best Practices: Configuration templates, performance tuning, troubleshooting guides

Each recipe provides step-by-step instructions to help you quickly implement SGLang solutions for your specific requirements.

Guides

Autoregressive Models

Qwen

Qwen3.5 NEW
Qwen3
Qwen3-Next
Qwen3-VL
Qwen3-Coder
Qwen3-Coder-Next NEW
Qwen2.5-VL

DeepSeek

DeepSeek-V3.2
DeepSeek-V3.1
DeepSeek-V3
DeepSeek-R1
DeepSeek-OCR
DeepSeek-OCR-2 NEW

Llama

GLM

GLM-Glyph
GLM-5 NEW
GLM-OCR NEW
GLM-4.5
GLM-4.5V
GLM-4.6
GLM-4.6V
GLM-4.7
GLM-4.7-Flash NEW

OpenAI

gpt-oss

Moonshotai

Kimi-K2.6 NEW
Kimi-K2.5
Kimi-K2
Kimi-Linear

MiniMax

MiniMax-M2
MiniMax-M2.5 NEW

NVIDIA

Ernie

Mistral

Xiaomi

MiMo-V2-Flash

FlashLabs

Chroma 1.0NEW

StepFun

Step-3.5-Flash NEW
Step3-VL-10B NEW

InclusionAI

Ling-2.5-1T NEW
Ring-2.5-1T NEW
LLaDA-2.1 NEW

Diffusion Models

FLUX

FLUX

Qwen-Image

Wan

Z-Image

Z-Image-Turbo

Benchmarks

Reference

Installation (PyPI) - Install SGLang via pip or uv (stable and nightly)
Server arguments - Understanding all the arguments

🚀 Quick Start

Browse the recipe index above to find your model
Follow the step-by-step instructions in each guide
Adapt configurations to your specific hardware and requirements
Join our community to share feedback and improvements

🤝 Contributing

We believe the best documentation comes from practitioners. Whether you've optimized SGLang for a specific model, solved a tricky deployment challenge, or discovered performance improvements, we encourage you to contribute your recipes!

Ways to contribute:

Add a new recipe for a model not yet covered
Improve existing recipes with additional tips or configurations
Report issues or suggest enhancements
Share your production deployment experiences

To contribute:

<CodeGroup> ```bash Contribute a Recipe # Fork the repo and clone locally git clone https://github.com/YOUR_USERNAME/sglang-cookbook.git cd sglang-cookbook

Create a new branch

git checkout -b add-my-recipe

Add your recipe following the template in DeepSeek-V3.2

Submit a PR!

</CodeGroup>

## 🛠️ Local Development

### Prerequisites

- Node.js >= 20.0
- npm or yarn

### Setup and Run

Install dependencies and start the development server:

<CodeGroup>
```bash Local Development
# Install dependencies
npm install

# Start development server (hot reload enabled)
npm start

</CodeGroup>

The site will automatically open in your browser at http://localhost:3000.

📖 Resources

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Let's build this resource together! 🚀 Star the repo and contribute your recipes to help the SGLang community grow.

SGLang Cookbook