docs/en/models/index.mdx
CowAgent supports mainstream large language models from both Chinese and overseas vendors. Model interfaces are implemented under the project's models/ directory. In addition to text chat, some vendors also provide vision understanding, image generation, speech-to-text, text-to-speech, and embedding capabilities, which can be invoked on demand in the Agent flow.
LinkAI is also supported, letting you switch between multiple vendors with a single key while gaining knowledge bases, workflows, and plugins. </Note>
A snapshot of each vendor's capabilities. "Text" refers to the main chat model; the remaining columns indicate which Agent capabilities the vendor can handle.
| Vendor | Representative Models | Text | Image Understanding | Image Generation | Speech-to-Text | Text-to-Speech | Embedding |
|---|---|---|---|---|---|---|---|
| DeepSeek | deepseek-v4-flash / pro | ✅ | |||||
| MiniMax | MiniMax-M2.7 | ✅ | ✅ | ✅ | ✅ | ||
| Claude | claude-opus-4-7 | ✅ | ✅ | ||||
| Gemini | gemini-3.5-flash | ✅ | ✅ | ✅ | |||
| OpenAI | gpt-5.5, o-series | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Zhipu GLM | glm-5.1, glm-5v-turbo | ✅ | ✅ | ✅ | ✅ | ||
| Tongyi Qwen | qwen3.7-max | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Doubao | doubao-seed-2.0 series | ✅ | ✅ | ✅ | ✅ | ||
| Kimi | kimi-k2.6 | ✅ | ✅ | ||||
| Baidu Qianfan | ernie-5.1 | ✅ | ✅ | ||||
| LinkAI | 100+ models from multiple vendors | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Custom | Local models / third-party proxies | ✅ |
Option 1 (recommended): Manage models and capabilities online via the Web Console, with no need to edit the configuration file:
Option 2: Manually edit config.json and fill in the model name and API key according to the selected model. Every model also supports OpenAI-compatible access: set bot_type to openai and configure open_ai_api_base and open_ai_api_key.