docs_new/cookbook/diffusion/intro.mdx
Offline models generate each image or video request as a bounded denoising job. Most of them use bidirectional attention over the whole latent sequence.
<CardGroup cols={3}> <Card title="Cosmos" mode="card" href="/cookbook/diffusion/Cosmos/Cosmos3" img="/cards/logos/nvidia.png" /> <Card title="FLUX" mode="card" href="/cookbook/diffusion/FLUX/FLUX" img="/cards/logos/flux.png" /> <Card title="Ideogram 4" mode="card" href="/cookbook/diffusion/Ideogram/Ideogram4" img="/cards/logos/ideogram.png" /> <Card title="Wan" mode="card" href="/cookbook/diffusion/Wan/Wan2.2" img="/cards/logos/wan.png" /> <Card title="LTX" mode="card" href="/cookbook/diffusion/LTX/LTX2 & LTX2.3" img="/cards/logos/ltx.svg" /> <Card title="Qwen-Image" mode="card" href="/cookbook/diffusion/Qwen-Image/Qwen-Image" img="/cards/logos/qwen.png" /> <Card title="Z-Image" mode="card" href="/cookbook/diffusion/Z-Image/Z-Image-Turbo" img="/cards/logos/zimage.png" /> <Card title="ERNIE-Image" mode="card" href="/cookbook/diffusion/Ernie-Image/Ernie-Image" img="/cards/logos/ernie.png" /> <Card title="MOVA" mode="card" href="/cookbook/diffusion/MOVA/MOVA" img="/cards/logos/mova.png" /> </CardGroup>Realtime models keep a session alive and generate chunk by chunk with causal state, control signals, and cached video history.
<CardGroup cols={3}> <Card title="LingBot World (Realtime)" mode="card" href="/cookbook/diffusion/LingBot-World/LingBot-World" img="/cards/logos/inclusionai.png" /> <Card title="SANA-WM" mode="card" href="/cookbook/diffusion/SANA-WM/SANA-WM" img="/cards/logos/sana.png" /> </CardGroup>