docs/providers/deepinfra.md
DeepInfra provides a unified API that routes requests to the most popular open source and frontier models behind a single endpoint and API key. It is OpenAI-compatible, so most OpenAI SDKs work by switching the base URL.
openclaw onboard --deepinfra-api-key <key>
Or set the environment variable:
export DEEPINFRA_API_KEY="<your-deepinfra-api-key>" # pragma: allowlist secret
{
env: { DEEPINFRA_API_KEY: "<your-deepinfra-api-key>" }, // pragma: allowlist secret
agents: {
defaults: {
model: { primary: "deepinfra/deepseek-ai/DeepSeek-V4-Flash" },
},
},
}
The bundled plugin registers all DeepInfra surfaces that match current
OpenClaw provider contracts. Chat, image generation, and video generation
refresh their model catalogues live from /v1/openai/models?sort_by=openclaw&filter=with_meta
when DEEPINFRA_API_KEY is configured; the other surfaces use the curated
static defaults below.
| Surface | Default model | OpenClaw config/tool |
|---|---|---|
| Chat / model provider | first chat-tagged entry from live catalog (manifest fallback deepseek-ai/DeepSeek-V4-Flash) | agents.defaults.model |
| Image generation/editing | first image-gen-tagged entry from live catalog (static fallback black-forest-labs/FLUX-1-schnell) | image_generate, agents.defaults.imageGenerationModel |
| Media understanding | moonshotai/Kimi-K2.5 for images | inbound image understanding |
| Speech-to-text | openai/whisper-large-v3-turbo | inbound audio transcription |
| Text-to-speech | hexgrad/Kokoro-82M | messages.tts.provider: "deepinfra" |
| Video generation | first video-gen-tagged entry from live catalog (static fallback Pixverse/Pixverse-T2V) | video_generate, agents.defaults.videoGenerationModel |
| Memory embeddings | BAAI/bge-m3 | agents.defaults.memorySearch.provider: "deepinfra" |
DeepInfra also exposes reranking, classification, object-detection, and other native model types. OpenClaw does not currently have first-class provider contracts for those categories, so this plugin does not register them yet.
OpenClaw dynamically discovers available DeepInfra models at startup. Use
/models deepinfra to see the full list of models available.
Any model available on DeepInfra.com can be used with the deepinfra/ prefix:
deepinfra/deepseek-ai/DeepSeek-V4-Flash
deepinfra/deepseek-ai/DeepSeek-V3.2
deepinfra/MiniMaxAI/MiniMax-M2.5
deepinfra/moonshotai/Kimi-K2.5
deepinfra/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
deepinfra/zai-org/GLM-5.1
...and many more
deepinfra/<provider>/<model> (e.g., deepinfra/Qwen/Qwen3-Max).deepinfra/deepseek-ai/DeepSeek-V4-Flashhttps://api.deepinfra.com/v1/openaihttps://api.deepinfra.com/v1/inference/<model>.