README.md
[English] | [<a href="docs/zh/README.md">δΈζ</a>] | [<a href="docs/ja/README.md">ζ₯ζ¬θͺ</a>]
</p>CowAgent is an open-source super AI assistant that proactively plans tasks, controls your computer and external services, creates and runs Skills, and grows alongside you through a personal knowledge base and long-term memory β a reference implementation of Agent Harness engineering.
CowAgent is lightweight, easy to deploy, and built to extend. Plug in any major LLM provider and run it 24/7 on a personal computer or server, across the web and all major IM platforms.
<p align="center"> <a href="https://cowagent.ai/">π Website</a> Β· <a href="https://docs.cowagent.ai/intro/index">π Docs</a> Β· <a href="https://docs.cowagent.ai/guide/quick-start">π Quick Start</a> Β· <a href="https://skills.cowagent.ai/">π§© Skill Hub</a> Β· <a href="https://link-ai.tech/cowagent/create">βοΈ Try Online</a> </p>| Capability | Description |
|---|---|
| Planning | Decomposes complex tasks and executes them step by step, looping over tools until the goal is reached |
| Memory | Three-tier architecture (context β daily β core), automatic Deep Dream distillation, hybrid keyword + vector retrieval |
| Knowledge | Auto-curates structured knowledge into a Markdown wiki, builds an evolving knowledge graph with visual browsing |
| Skills | One-click install from Skill Hub, GitHub, ClawHub; or create custom skills via natural-language conversation |
| Tools | Built-in file I/O, terminal, browser, scheduler, memory retrieval, web search, and 10+ more tools β with native MCP integration |
| Channels | Integrates with Web, WeChat, Feishu, DingTalk, WeCom, QQ, Official Accounts, Telegram, and Slack |
| Multimodal | First-class support for text, images, voice, and files β recognition, generation, and delivery |
| Models | Claude, GPT, Gemini, DeepSeek, Qwen, GLM, Kimi, MiniMax, Doubao, and more β swap providers from the Web console with one click |
| Deploy | One-line installer, unified Web console, multiple deployment modes (local, Docker, server) |
CowAgent is a complete Agent Harness: messages flow in through Channels; the Agent Core plans and reasons over memory, knowledge, and the available tools and skills; Models generate the response, which is sent back through the originating channel. Every layer is decoupled and independently extensible.
Read more in Architecture.
A one-line installer takes care of dependencies, configuration, and startup:
Linux / macOS:
bash <(curl -fsSL https://cdn.link-ai.tech/code/cow/run.sh)
Windows (PowerShell):
irm https://cdn.link-ai.tech/code/cow/run.ps1 | iex
Docker:
curl -O https://cdn.link-ai.tech/code/cow/docker-compose.yml
docker compose up -d
Once started, open http://localhost:9899 to access the Web console β your one-stop hub to chat with the Agent, configure models, connect channels, and install skills.
Deploying on a server? Set
web_hostto0.0.0.0inconfig.jsonto make the console reachable from outside, and setweb_passwordto protect it. Don't forget to open port9899in your firewall or security group.
π Detailed guides: Quick Start Β· Install from Source Β· Upgrade
After installation, manage the service with the cow CLI:
cow start | stop | restart # service control
cow status | logs # status and logs
cow update # pull latest code and restart
cow skill install <name> # install a skill
cow install-browser # install browser automation
CowAgent supports all mainstream LLM providers. Chat, vision, image generation, ASR/TTS, and embeddings can each be routed to a different vendor. Providers are configured directly in the Web console β no manual file editing required.
| Provider | Featured Models | Chat | Vision | Image Gen | ASR | TTS | Embedding |
|---|---|---|---|---|---|---|---|
| Claude | claude-opus-4-8 | β | β | ||||
| OpenAI | gpt-5.5, o-series | β | β | β | β | β | β |
| Gemini | gemini-3.5-flash | β | β | β | |||
| DeepSeek | deepseek-v4-flash / pro | β | |||||
| Qwen | qwen3.7-max | β | β | β | β | β | β |
| GLM | glm-5.1, glm-5v-turbo | β | β | β | β | ||
| Doubao | doubao-seed-2.0 series | β | β | β | β | ||
| Kimi | kimi-k2.6 | β | β | ||||
| MiniMax | MiniMax-M2.7 | β | β | β | β | ||
| ERNIE | ernie-5.1 | β | β | ||||
| MiMo | mimo-v2.5 / pro | β | β | β | |||
| LinkAI | One key for 100+ models | β | β | β | β | β | β |
| Custom | Local models / third-party proxy | β |
For details on each provider, see the Models overview.
A single Agent instance can serve multiple channels in parallel. Most channels can be onboarded right from the Web console.
| Channel | Text | Image | File | Voice | Group |
|---|---|---|---|---|---|
| Web Console (default) | β | β | β | β | |
| Telegram | β | β | β | β | β |
| Slack | β | β | β | β | |
| Discord | β | β | β | β | |
| β | β | β | β | ||
| Feishu / Lark | β | β | β | β | β |
| DingTalk | β | β | β | β | β |
| WeCom Bot | β | β | β | β | β |
| β | β | β | β | ||
| WeCom App | β | β | β | β | |
| WeChat Customer Service | β | β | β | β | |
| WeChat Official Account | β | β | β |
See the Channels overview for setup details.
The Web console is the default channel and the unified entry point to configure models, channels, skills, memory, and more.
Long-term memory uses a three-tier architecture: conversation context (short-term) β daily memory (mid-term) β MEMORY.md (long-term). A nightly Deep Dream pass distills scattered memories into refined long-term entries and a narrative journal. See Long-term Memory Β· Deep Dream.
Personal knowledge base complements the time-ordered memory by organizing structured knowledge by topic. The Agent automatically curates valuable information from conversations, maintains cross-references and indexes, and the Web console offers an interactive knowledge-graph view. See Personal Knowledge Base.
<table> <tr> <td width="50%"> <p align="center"><em>Long-term Memory Β· Three-tier architecture + Deep Dream</em></p>
</td>
<td width="50%">
<p align="center"><em>Knowledge Base Β· Auto-curated Markdown wiki</em></p>
</td>
Tools are atomic capabilities the Agent uses to interact with system resources. Skills are higher-level workflows defined by a manifest file that compose multiple tools to accomplish complex tasks.
Built-in tools cover file I/O (read / write / edit / ls), terminal (bash), file sending (send), memory retrieval (memory), environment variables (env_config), web fetching (web_fetch), scheduling (scheduler), web search (web_search), vision (vision), and browser automation (browser).
MCP protocol integrates the open ecosystem of Model Context Protocol servers. A single mcp.json is enough β supports stdio / SSE transports, hot reload, and zero-code integration.
Learn more: Tools overview Β· MCP integration.
skill-creator; turn any workflow or third-party API into a reusable skill/skill list # list installed skills
/skill search <keyword> # search the marketplace
/skill install <name> # one-click install
Learn more: Skills overview Β· Creating Skills.
2026.06.01: v2.1.0 β Internationalization, new channels (Telegram, Discord, Slack, WeChat Customer Service), CLI interaction upgrades, streamlined one-line install, MCP Streamable HTTP support, new models (claude-opus-4-8, MiMo).
2026.05.22: v2.0.9 β Model management, MCP protocol support, persistent browser sessions, new models (gpt-5.5, gemini-3.5-flash, qwen3.7-max), deployment hardening.
2026.05.06: v2.0.8 β Feishu channel overhaul (voice, streaming, QR onboarding), DeepSeek V4 and Baidu Qianfan support, scheduler tool upgrades.
2026.04.22: v2.0.7 β Built-in image generation (GPT Image 2, Nano Banana), new models (Kimi K2.6, Claude Opus 4.7, GLM 5.1), memory and knowledge enhancements.
2026.04.14: v2.0.6 β Knowledge base, Deep Dream memory distillation, smart context compression, multi-session Web console.
2026.04.01: v2.0.5 β Cow CLI, Skill Hub open source, browser tool, WeCom Bot QR onboarding.
2026.02.03: v2.0.0 β Major upgrade to a super Agent assistant with multi-step task planning, long-term memory, and the Skills framework.
Full history: Release Notes
File an issue on GitHub, or scan the QR code below to join our WeChat community:
LinkAI is an all-in-one AI Agent platform for enterprises and developers, offering managed hosting and enterprise-grade support for CowAgent:
For enterprise inquiries: [email protected] or scan the QR code to reach our team on WeChat.
Contributions are welcome β add a new channel by following the Feishu channel reference, or contribute new skills to Skill Hub.
β Star the project to follow updates, and feel free to open PRs and Issues.
This project was previously named chatgpt-on-wechat and is now officially CowAgent. The old GitHub URL redirects automatically; existing users may optionally run git remote set-url origin https://github.com/zhayujie/CowAgent.git to update the local remote.