Back to Chatgpt On Wechat

Models Overview

docs/models/index.mdx

2.1.22.3 KB
Original Source

CowAgent supports a wide range of mainstream large language models. Model interfaces live under the project's models/ directory. Beyond text chat, several providers also provide vision understanding, image generation, speech-to-text, text-to-speech, and embeddings — all of which can be invoked on demand in the Agent flow.

Capability Matrix

A snapshot of each provider's capabilities. "Text" refers to the main chat model; the remaining columns show which Agent capabilities the provider can power.

ProviderRepresentative ModelsTextVisionImage GenSTTTTSEmbedding
DeepSeekdeepseek-v4-flash / pro
MiniMaxMiniMax-M3
Claudeclaude-fable-5
Geminigemini-3.5-flash
OpenAIgpt-5.5, o-series
GLMglm-5.2, glm-5v-turbo
Qwenqwen3.7-plus
Doubaodoubao-seed-2.0 series
Kimikimi-k2.7-code
ERNIEernie-5.1
MiMomimo-v2.5-pro / v2.5
LinkAI100+ models from multiple providers
CustomLocal models / third-party proxies
<Tip> Every capability in the Web console (Vision / Image / STT / TTS / Embedding / Web Search) can be configured independently with its own provider and model — there is no forced binding between them. </Tip>

How to Configure

Option 1 (recommended): Manage models and capabilities online via the Web console, with no need to edit the configuration file:

Option 2: Edit config.json manually and fill in the model name and API key for the selected provider. Every model also supports OpenAI-compatible access — just set bot_type to openai and configure open_ai_api_base and open_ai_api_key.