Models Overview - Chatgpt On Wechat

CowAgent supports a wide range of mainstream large language models. Model interfaces live under the project's models/ directory. Beyond text chat, several vendors also provide vision understanding, image generation, speech-to-text, text-to-speech, and embeddings — all of which can be invoked on demand in the Agent flow.

Capability Matrix

A snapshot of each vendor's capabilities. "Text" refers to the main chat model; the remaining columns show which Agent capabilities the vendor can power.

Vendor	Representative Models	Text	Vision	Image Gen	STT	TTS	Embedding
DeepSeek	deepseek-v4-flash / pro	✅
MiniMax	MiniMax-M2.7	✅	✅	✅		✅
Claude	claude-opus-4-8	✅	✅
Gemini	gemini-3.5-flash	✅	✅	✅
OpenAI	gpt-5.5, o-series	✅	✅	✅	✅	✅	✅
GLM	glm-5.1, glm-5v-turbo	✅	✅		✅		✅
Qwen	qwen3.7-max	✅	✅	✅	✅	✅	✅
Doubao	doubao-seed-2.0 series	✅	✅	✅			✅
Kimi	kimi-k2.6	✅	✅
ERNIE	ernie-5.1	✅	✅
MiMo	mimo-v2.5-pro / v2.5	✅	✅			✅
LinkAI	100+ models from multiple vendors	✅	✅	✅	✅	✅	✅
Custom	Local models / third-party proxies	✅

<Tip> Every capability in the Web console (Vision / Image / STT / TTS / Embedding / Web Search) can be configured independently with its own vendor and model — there is no forced binding between them. </Tip>

How to Configure

Option 1 (recommended): Manage models and capabilities online via the Web console, with no need to edit the configuration file:

Option 2: Edit config.json manually and fill in the model name and API key for the selected vendor. Every model also supports OpenAI-compatible access — just set bot_type to openai and configure open_ai_api_base and open_ai_api_key.