Back to Chatgpt On Wechat

Models Overview

docs/en/models/index.mdx

2.0.92.7 KB
Original Source

CowAgent supports mainstream large language models from both Chinese and overseas vendors. Model interfaces are implemented under the project's models/ directory. In addition to text chat, some vendors also provide vision understanding, image generation, speech-to-text, text-to-speech, and embedding capabilities, which can be invoked on demand in the Agent flow.

<Note> The following models are recommended in Agent mode; choose based on quality and cost: deepseek-v4-flash, MiniMax-M2.7, claude-sonnet-4-6, gemini-3.5-flash, glm-5.1, qwen3.6-plus, kimi-k2.6, ernie-5.1.

LinkAI is also supported, letting you switch between multiple vendors with a single key while gaining knowledge bases, workflows, and plugins. </Note>

Capability Matrix

A snapshot of each vendor's capabilities. "Text" refers to the main chat model; the remaining columns indicate which Agent capabilities the vendor can handle.

VendorRepresentative ModelsTextImage UnderstandingImage GenerationSpeech-to-TextText-to-SpeechEmbedding
DeepSeekdeepseek-v4-flash / pro
MiniMaxMiniMax-M2.7
Claudeclaude-opus-4-7
Geminigemini-3.5-flash
OpenAIgpt-5.5, o-series
Zhipu GLMglm-5.1, glm-5v-turbo
Tongyi Qwenqwen3.7-max
Doubaodoubao-seed-2.0 series
Kimikimi-k2.6
Baidu Qianfanernie-5.1
LinkAI100+ models from multiple vendors
CustomLocal models / third-party proxies
<Tip> Every capability in the Web Console (Vision / Image / Speech-to-Text / Text-to-Speech / Embedding / Web Search) can be configured independently with its own vendor and model; they are not forced to be bound together. </Tip>

How to Configure

Option 1 (recommended): Manage models and capabilities online via the Web Console, with no need to edit the configuration file:

Option 2: Manually edit config.json and fill in the model name and API key according to the selected model. Every model also supports OpenAI-compatible access: set bot_type to openai and configure open_ai_api_base and open_ai_api_key.