docs_new/cookbook/autoregressive/InternLM/Intern-S1.mdx
import { InternS1Deployment } from '/src/snippets/autoregressive/intern-s1-deployment.jsx';
Intern-S1 includes the large Intern-S1 MoE model and the smaller Intern-S1-mini dense model. The command generator below covers BF16 and FP8 serving on NVIDIA H100/H200/B200/B300 platforms.
Refer to the official SGLang installation guide, or install from source:
uv pip install 'git+https://github.com/sgl-project/sglang.git#subdirectory=python'
--attention-backend flashinfer.--reasoning-parser interns1 and --tool-call-parser interns1 when your workload needs structured reasoning or tool-call parsing.