Back to Weknora

README

README.md

0.6.017.7 KB
Original Source
<p align="center"> <picture> </picture> </p> <p align="center"> <picture> <a href="https://trendshift.io/repositories/15289" target="_blank">
</a>
</picture> </p> <p align="center"> <a href="https://weknora.weixin.qq.com" target="_blank">
</a>
<a href="https://chatbot.weixin.qq.com" target="_blank">
    
</a>
<a href="https://chromewebstore.google.com/detail/jpemjbopikggjlmikmclgbmkhhopjdgd" target="_blank">
    
</a>
<a href="https://clawhub.ai/lyingbug/weknora" target="_blank">
    
</a>
<a href="https://github.com/Tencent/WeKnora/blob/main/LICENSE">
    
</a>
<a href="./CHANGELOG.md">
    
</a>
</p> <p align="center"> | <b>English</b> | <a href="./README_CN.md"><b>็ฎ€ไฝ“ไธญๆ–‡</b></a> | <a href="./README_JA.md"><b>ๆ—ฅๆœฌ่ชž</b></a> | <a href="./README_KO.md"><b>ํ•œ๊ตญ์–ด</b></a> | </p> <p align="center"> <h4 align="center">

Overview โ€ข Architecture โ€ข Key Features โ€ข Getting Started โ€ข API Reference โ€ข Developer Guide

</h4> </p>

๐Ÿ’ก WeKnora โ€” Turn Documents into Living Knowledge with RAG, Agents and Auto-Wiki

๐Ÿ“Œ Overview

WeKnora is an open-source, LLM-powered knowledge framework built for enterprise-grade document understanding, semantic retrieval, and autonomous reasoning.

It is organized around three core capabilities: RAG-based Quick Q&A for everyday lookups, a ReAct Agent that autonomously orchestrates retrieval, MCP tools and web search to handle complex multi-step tasks, and a brand-new Wiki Mode in which agents distill raw documents into a self-maintaining, interlinked markdown knowledge base with an interactive knowledge graph. Combined with multi-source ingestion (Feishu / Notion / Yuque, and growing), 20+ LLM provider integrations, full Langfuse observability, enterprise-ready multi-tenant RBAC (4-tier role matrix + per-resource ownership + per-tenant audit log), and a fully self-hostable modular architecture, WeKnora turns scattered documents into a queryable, reasoning-capable, continuously evolving knowledge asset.

The framework supports auto-syncing knowledge from Feishu, Notion, and Yuque (more data sources coming soon), handles 10+ document formats including PDF, Word, images, and Excel, and can serve Q&A directly through IM channels like WeCom, Feishu, Slack, and Telegram. It is compatible with major LLM providers including OpenAI, DeepSeek, Qwen (Alibaba Cloud), Zhipu, Hunyuan, Gemini, MiniMax, NVIDIA, and Ollama. Its fully modular design allows swapping LLMs, vector databases, and storage backends, with support for local and private cloud deployment ensuring complete data sovereignty. WeKnora also integrates with Langfuse for comprehensive observability into agent reasoning, token usage, and pipeline tracing.

โœจ Latest Updates

  • v0.6.0 โ€” Tenant RBAC (4-tier role matrix Owner / Admin / Contributor / Viewer + per-KB ownership + per-tenant audit log), tenant member management & multi-workspace UX, self-service workspaces; weknora CLI v0.4 GA with mcp serve; KB retrieval fan-out across vector stores; AES-256-GCM credential encryption + docreader gRPC TLS + Token; Zhipu embedder + Huawei OBS; server-side user preferences; Go 1.26.0. See docs/RBAC่ฏดๆ˜Ž.md and CHANGELOG.md.
  • v0.5.2 โ€” Wiki ingest scales to 40k-document KBs (task queue + DLQ); MCP human-in-the-loop tool approval; Anthropic / Apache Doris / Tencent VectorDB / KS3 / SearXNG backends; adaptive 3-tier chunking with live preview; global โŒ˜K command palette; Yuque connector + WeChat Mini Program; weknora CLI preview.
  • v0.5.1 โ€” Knowledge-base batch management; tenant-wide IM channels overview; session search + user-scoped pinning; unified Model / Web Search / MCP settings cards; per-agent LLM timeout; desktop tenant switching.
  • v0.5.0 โ€” Wiki Mode GA โ€” agents auto-generate structured, interlinked Markdown wiki pages with a knowledge graph; wiki browser + visual graph in the UI.
  • v0.4.0 โ€” WeKnora Cloud (hosted LLM + parsing); Chrome Extension; ClawHub Skill; WeChat IM; attachment processing; Azure OpenAI / Alibaba OSS; Notion connector; Baidu + Ollama web search; VectorStore management.
  • v0.3.6 โ€” ASR (audio); Feishu data-source auto-sync; OIDC; IM quote-reply context + thread-based sessions; document summarization; Tavily search; parallel tool calling; agent @mention scope restriction.
  • v0.3.5 โ€” Telegram / DingTalk / Mattermost IM; IM slash commands + QA queue; suggested questions; VLM auto-describe MCP tool images; Novita AI; channel tracking.
  • v0.3.4 โ€” WeCom / Feishu / Slack IM; multimodal image support; NVIDIA model API; Weaviate; AWS S3; AES-256-GCM API-key encryption; built-in MCP service; hybrid-search optimization; final_answer tool.
  • v0.3.3 โ€” Parent-child chunking; KB pinning; fallback response; passage cleaning for rerank; storage auto-creation; Milvus.
  • v0.3.2 โ€” Knowledge Search entry; per-source parser & storage engine config; image rendering in local storage; document preview; Volcengine TOS; Mermaid rendering; batch session management; memory graph preview.
  • v0.3.0 โ€” Shared Space; Agent Skills + sandboxed execution; custom agents; Data Analyst agent; thinking mode; Bing / Google web search; API Key auth; Helm chart; Korean i18n; Qdrant.
  • v0.2.0 โ€” Agent Mode (ReACT); multi-type knowledge bases (FAQ + document); conversation strategy config; DuckDuckGo web search; MCP tool integration; new UI with agent mode switching; MQ async task management.

๐Ÿ“ฑ Interface Showcase

<table> <tr> <td colspan="2" align="center"><b>๐Ÿ’ฌ Intelligent Q&A Conversation</b> </td> </tr> <tr> <td width="50%" align="center"><b>๐Ÿ“– Wiki Browser</b> </td> <td width="50%" align="center"><b>๐Ÿ•ธ๏ธ Wiki Knowledge Graph</b> </td> </tr> <tr> <td width="50%" align="center"><b>๐Ÿค– Agent Mode ยท Tool Call Process</b> </td> <td width="50%" align="center"><b>โš™๏ธ Conversation Settings</b> </td> </tr> <tr> <td colspan="2" align="center"><b>๐Ÿ”ญ Observability ยท Langfuse Tracing</b> </td> </tr> </table>

๐Ÿ—๏ธ Architecture

Fully modular pipeline from document parsing, vectorization, and retrieval to LLM inference โ€” every component is swappable and extensible. Supports local / private cloud deployment with full data sovereignty and a zero-barrier Web UI for quick onboarding.

๐Ÿงฉ Feature Overview

Intelligent Conversation

CapabilityDetails
Intelligent ReasoningReACT progressive multi-step reasoning, autonomously orchestrating knowledge retrieval, MCP tools, and web search; custom agent support
Quick Q&ARAG-based Q&A over knowledge bases for fast and accurate answers
Wiki ModeAgent-driven auto-generation of structured, interlinked markdown Wiki pages from raw documents
Tool CallingBuilt-in tools, MCP tools, web search
Conversation StrategyOnline Prompt editing, retrieval threshold tuning, multi-turn context awareness
Suggested QuestionsAuto-generated question suggestions based on knowledge base content

Knowledge Management

CapabilityDetails
Knowledge Base TypesFAQ / Document / Wiki with folder import, URL import, tag management, and online entry
Data Source ImportAuto-sync from Feishu / Notion / Yuque (more data sources coming soon); incremental and full sync
Document FormatsPDF / Word / Txt / Markdown / HTML / Images / CSV / Excel / PPT / JSON
Retrieval StrategiesBM25 sparse / Dense retrieval / GraphRAG / parent-child chunking / multi-dimensional indexing
E2E TestingFull-pipeline visualization with recall hit rate, BLEU / ROUGE metric evaluation

Integrations & Extensions

CapabilityDetails
LLMsOpenAI / Azure OpenAI / Anthropic (Claude) / DeepSeek / Qwen (Alibaba Cloud) / Zhipu / Hunyuan / Doubao (Volcengine) / Gemini / MiniMax / NVIDIA / Novita AI / SiliconFlow / OpenRouter / Ollama
EmbeddingsOllama / BGE / GTE / Zhipu / OpenAI-compatible APIs
Vector DBsPostgreSQL (pgvector) / Elasticsearch / Milvus / Weaviate / Qdrant / Apache Doris / Tencent VectorDB
Object StorageLocal / MinIO / AWS S3 / Volcengine TOS / Alibaba Cloud OSS / Kingsoft Cloud KS3 / Huawei Cloud OBS
IM ChannelsWeCom / Feishu / Slack / Telegram / DingTalk / Mattermost / WeChat
Web SearchDuckDuckGo / Bing / Google / Tavily / Baidu / Ollama / SearXNG

Platform

CapabilityDetails
DeploymentLocal / Docker / Kubernetes (Helm) with private and offline support
UIWeb UI / RESTful API / CLI (weknora) / Chrome Extension / WeChat Mini Program
Access ControlTenant RBAC with 4-tier role matrix (Owner / Admin / Contributor / Viewer), per-KB resource ownership, per-tenant audit log, invite-only workspaces, self-service tenant creation, cross-tenant superuser
SecurityAES-256-GCM at-rest encryption for API keys and MCP / data-source credentials with graceful key rotation; gRPC TLS + Token between app and docreader; SSRF-safe HTTP client; sandbox isolation for agent skills
ObservabilityIntegrated Langfuse for ReAct loops, token tracking, tool calls, and pipeline tracing
Task ManagementMQ async tasks, automatic database migration on version upgrade
Model ManagementCentralized config, per-knowledge-base model selection, multi-tenant built-in model sharing, WeKnora Cloud hosted models and parsing

๐Ÿงฉ Chrome Extension

WeKnora Chrome Extension lets you capture web content directly into your WeKnora knowledge base. Select text, images, or entire pages in the browser and save them as knowledge entries with one click โ€” no copy-paste or file upload needed.

๐Ÿ“ฑ WeChat Mini Program

The WeKnora Mini Program provides a lightweight mobile client for configuring WeKnora API access, selecting knowledge bases, importing URLs, and asking knowledge chat from WeChat.

๐Ÿฆž ClawHub Skill

WeKnora ClawHub Skill is a WeKnora skill published on the ClawHub platform. Once installed, it enables document import (file / URL / Markdown), hybrid search (vector + keyword) across knowledge bases, and knowledge entry management โ€” all through the WeKnora REST API.

  • Document Import โ€” Upload files, import web pages, or write Markdown knowledge via the agent
  • Hybrid Search โ€” Search within or across knowledge bases with vector + keyword retrieval
  • Knowledge Management โ€” List, browse, edit, and delete knowledge entries programmatically

โŒจ๏ธ Command-Line Interface

weknora is the official CLI for driving the API from a terminal or AI agent. The command surface mirrors gh CLI's <noun> <verb> convention; output is human-readable by default and switches to a stable JSON envelope with --json.

bash
weknora auth login --host https://kb.example.com
weknora kb list
weknora link --kb my-knowledge-base    # bind the current directory
weknora doc upload notes.md
weknora chat "summarise the design doc"

See cli/README.md for install + 5-minute quickstart and cli/AGENTS.md for the operational contract that AI agents (Claude Code, Cursor, Aider, โ€ฆ) can rely on.

๐Ÿš€ Getting Started

๐Ÿ›  Prerequisites

๐Ÿ“ฆ Installation & Launch

bash
git clone https://github.com/Tencent/WeKnora.git
cd WeKnora
cp .env.example .env   # Edit .env as needed, see comments in the file
docker compose up -d   # Start core services

Once started, visit http://localhost to get started.

To use a local Ollama model, run ollama serve > /dev/null 2>&1 & first.

๐Ÿ”ง Optional Services (Docker Compose Profiles)

Add --profile flags to enable additional components. Multiple profiles can be combined:

ProfileDescriptionCommand
(default)Core servicesdocker compose up -d
fullAll featuresdocker compose --profile full up -d
neo4jKnowledge Graph (Neo4j)docker compose --profile neo4j up -d
minioObject Storage (MinIO)docker compose --profile minio up -d
langfuseTracing (Langfuse)docker compose --profile langfuse up -d

Combine profiles: docker compose --profile neo4j --profile minio up -d

Stop services: docker compose down

๐ŸŒ Service URLs

ServiceURL
Web UIhttp://localhost
Backend APIhttp://localhost:8080
Langfuse Tracinghttp://localhost:3000

MCP Server

Please refer to the MCP Configuration Guide for the necessary setup.

๐Ÿ”Œ Using WeChat Dialog Open Platform

WeKnora serves as the core technology framework for the WeChat Dialog Open Platform, providing a more convenient usage approach:

  • Zero-code Deployment: Simply upload knowledge to quickly deploy intelligent Q&A services within the WeChat ecosystem, achieving an "ask and answer" experience
  • Efficient Question Management: Support for categorized management of high-frequency questions, with rich data tools to ensure accurate, reliable, and easily maintainable answers
  • WeChat Ecosystem Integration: Through the WeChat Dialog Open Platform, WeKnora's intelligent Q&A capabilities can be seamlessly integrated into WeChat Official Accounts, Mini Programs, and other WeChat scenarios, enhancing user interaction experiences

๐Ÿ“˜ API Reference

Troubleshooting FAQ: Troubleshooting FAQ

Detailed API documentation is available at: API Docs

Product plans and upcoming features: Roadmap

๐Ÿงญ Developer Guide

If you need to frequently modify code, you don't need to rebuild Docker images every time! Use fast development mode:

bash
# Start infrastructure
make dev-start

# Start backend (new terminal)
make dev-app

# Start frontend (new terminal)
make dev-frontend

Development Advantages:

  • โœ… Frontend modifications auto hot-reload (no restart needed)
  • โœ… Backend modifications quick restart (5-10 seconds, supports Air hot-reload)
  • โœ… No need to rebuild Docker images
  • โœ… Support IDE breakpoint debugging

Detailed Documentation: Development Environment Quick Start

๐Ÿค Contributing

Welcome to submit Issues or Pull Requests.

Process: Fork โ†’ Create branch โ†’ Commit changes โ†’ Open PR

Standards: Format code with gofmt, follow Conventional Commits (feat: / fix: / docs: / test: / refactor:)

๐Ÿ”’ Security Notice

Important: Starting from v0.1.3, WeKnora includes login authentication functionality to enhance system security. For production deployments, we strongly recommend:

  • Deploy WeKnora services in internal/private network environments rather than public internet
  • Avoid exposing the service directly to public networks to prevent potential information leakage
  • Configure proper firewall rules and access controls for your deployment environment
  • Regularly update to the latest version for security patches and improvements

๐Ÿ‘ฅ Contributors

Thanks to these excellent contributors:

๐Ÿ“„ License

This project is licensed under the MIT License. You are free to use, modify, and distribute the code with proper attribution.

๐Ÿ“ˆ Project Statistics

<a href="https://www.star-history.com/#Tencent/WeKnora&type=date&legend=top-left"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/svg?repos=Tencent/WeKnora&type=date&theme=dark&legend=top-left" /> <source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/svg?repos=Tencent/WeKnora&type=date&legend=top-left" /> </picture> </a>