Back to Openclaw

DeepInfra

docs/providers/deepinfra.md

2026.5.274.0 KB
Original Source

DeepInfra provides a unified API that routes requests to the most popular open source and frontier models behind a single endpoint and API key. It is OpenAI-compatible, so most OpenAI SDKs work by switching the base URL.

Getting an API key

  1. Go to https://deepinfra.com/
  2. Sign in or create an account
  3. Navigate to Dashboard / Keys and generate a new API key or use the auto created one

CLI setup

bash
openclaw onboard --deepinfra-api-key <key>

Or set the environment variable:

bash
export DEEPINFRA_API_KEY="<your-deepinfra-api-key>" # pragma: allowlist secret

Config snippet

json5
{
  env: { DEEPINFRA_API_KEY: "<your-deepinfra-api-key>" }, // pragma: allowlist secret
  agents: {
    defaults: {
      model: { primary: "deepinfra/deepseek-ai/DeepSeek-V4-Flash" },
    },
  },
}

Supported OpenClaw surfaces

The bundled plugin registers all DeepInfra surfaces that match current OpenClaw provider contracts. Chat, image generation, and video generation refresh their model catalogues live from /v1/openai/models?sort_by=openclaw&filter=with_meta when DEEPINFRA_API_KEY is configured; the other surfaces use the curated static defaults below.

SurfaceDefault modelOpenClaw config/tool
Chat / model providerfirst chat-tagged entry from live catalog (manifest fallback deepseek-ai/DeepSeek-V4-Flash)agents.defaults.model
Image generation/editingfirst image-gen-tagged entry from live catalog (static fallback black-forest-labs/FLUX-1-schnell)image_generate, agents.defaults.imageGenerationModel
Media understandingmoonshotai/Kimi-K2.5 for imagesinbound image understanding
Speech-to-textopenai/whisper-large-v3-turboinbound audio transcription
Text-to-speechhexgrad/Kokoro-82Mmessages.tts.provider: "deepinfra"
Video generationfirst video-gen-tagged entry from live catalog (static fallback Pixverse/Pixverse-T2V)video_generate, agents.defaults.videoGenerationModel
Memory embeddingsBAAI/bge-m3agents.defaults.memorySearch.provider: "deepinfra"

DeepInfra also exposes reranking, classification, object-detection, and other native model types. OpenClaw does not currently have first-class provider contracts for those categories, so this plugin does not register them yet.

Available models

OpenClaw dynamically discovers available DeepInfra models at startup. Use /models deepinfra to see the full list of models available.

Any model available on DeepInfra.com can be used with the deepinfra/ prefix:

deepinfra/deepseek-ai/DeepSeek-V4-Flash
deepinfra/deepseek-ai/DeepSeek-V3.2
deepinfra/MiniMaxAI/MiniMax-M2.5
deepinfra/moonshotai/Kimi-K2.5
deepinfra/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
deepinfra/zai-org/GLM-5.1
...and many more

Notes

  • Model refs are deepinfra/<provider>/<model> (e.g., deepinfra/Qwen/Qwen3-Max).
  • Default model: deepinfra/deepseek-ai/DeepSeek-V4-Flash
  • Base URL: https://api.deepinfra.com/v1/openai
  • Native video generation uses https://api.deepinfra.com/v1/inference/<model>.