Back to Chatgpt On Wechat

LinkAI

docs/en/models/linkai.mdx

2.0.92.6 KB
Original Source

A single linkai_api_key gives you access to all capabilities of mainstream vendors such as OpenAI, Claude, Gemini, DeepSeek, MiniMax, Qwen, Kimi, and Doubao.

<Tip> All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file. </Tip>

Text Chat

json
{
  "use_linkai": true,
  "linkai_api_key": "YOUR_API_KEY"
}
ParameterDescription
use_linkaiSet to true to enable
linkai_api_keyCreate one in the Console
modelCan be any code from the model list

See Model Service for more.

Image Understanding

Once configured, the Agent's Vision tool automatically calls multimodal models via the gateway, with no extra setup required. To manually specify a Vision model:

json
{
  "tools": {
    "vision": {
      "model": "gpt-5.4-mini"
    }
  }
}

Available models: gpt-4.1-mini, gpt-5.4-mini, qwen3.6-plus, doubao-seed-2-0-pro-260215, kimi-k2.6, claude-sonnet-4-6, gemini-3.1-flash-lite-preview, etc.

Image Generation

json
{
  "skills": {
    "image-generation": {
      "model": "gpt-image-2"
    }
  }
}
Model IDAlias
gpt-image-2OpenAI
gemini-3.1-flash-image-previewNano Banana 2
gemini-3-pro-image-previewNano Banana Pro
seedream-5.0-liteByteDance Doubao Seedream

Speech-to-Text (ASR)

json
{
  "voice_to_text": "linkai"
}

ASR uses Whisper by default; credentials are automatically reused from linkai_api_key.

Text-to-Speech (TTS)

The TTS gateway supports multiple underlying engines. The engine is selected by text_to_voice_model, and the available voices change with the engine.

json
{
  "text_to_voice": "linkai",
  "text_to_voice_model": "doubao",
  "tts_voice_id": "BV001_streaming"
}
text_to_voice_modelEngine
tts-1OpenAI · Multi-language (voices like alloy / nova / echo, etc.)
doubaoByteDance Doubao · Rich Chinese voices
baiduBaidu · Chinese broadcaster voices

Voices differ by engine; we recommend selecting them visually in the Web Console under "Model Management → Text-to-Speech".

Embedding

json
{
  "embedding_provider": "linkai",
  "embedding_model": "text-embedding-3-small"
}

The default model is text-embedding-3-small (OpenAI-compatible). After changing the embedding, run /memory rebuild-index to rebuild the index.