docs/en/models/linkai.mdx
A single linkai_api_key gives you access to all capabilities of mainstream vendors such as OpenAI, Claude, Gemini, DeepSeek, MiniMax, Qwen, Kimi, and Doubao.
{
"use_linkai": true,
"linkai_api_key": "YOUR_API_KEY"
}
| Parameter | Description |
|---|---|
use_linkai | Set to true to enable |
linkai_api_key | Create one in the Console |
model | Can be any code from the model list |
See Model Service for more.
Once configured, the Agent's Vision tool automatically calls multimodal models via the gateway, with no extra setup required. To manually specify a Vision model:
{
"tools": {
"vision": {
"model": "gpt-5.4-mini"
}
}
}
Available models: gpt-4.1-mini, gpt-5.4-mini, qwen3.6-plus, doubao-seed-2-0-pro-260215, kimi-k2.6, claude-sonnet-4-6, gemini-3.1-flash-lite-preview, etc.
{
"skills": {
"image-generation": {
"model": "gpt-image-2"
}
}
}
| Model ID | Alias |
|---|---|
gpt-image-2 | OpenAI |
gemini-3.1-flash-image-preview | Nano Banana 2 |
gemini-3-pro-image-preview | Nano Banana Pro |
seedream-5.0-lite | ByteDance Doubao Seedream |
{
"voice_to_text": "linkai"
}
ASR uses Whisper by default; credentials are automatically reused from linkai_api_key.
The TTS gateway supports multiple underlying engines. The engine is selected by text_to_voice_model, and the available voices change with the engine.
{
"text_to_voice": "linkai",
"text_to_voice_model": "doubao",
"tts_voice_id": "BV001_streaming"
}
text_to_voice_model | Engine |
|---|---|
tts-1 | OpenAI · Multi-language (voices like alloy / nova / echo, etc.) |
doubao | ByteDance Doubao · Rich Chinese voices |
baidu | Baidu · Chinese broadcaster voices |
Voices differ by engine; we recommend selecting them visually in the Web Console under "Model Management → Text-to-Speech".
{
"embedding_provider": "linkai",
"embedding_model": "text-embedding-3-small"
}
The default model is text-embedding-3-small (OpenAI-compatible). After changing the embedding, run /memory rebuild-index to rebuild the index.