FAQ

Basic Concepts

What is OpenViking? What problems does it solve?

OpenViking is an open-source context database designed specifically for AI Agents. It solves core pain points when building AI Agents:

Fragmented Context: Memories, resources, and skills are scattered everywhere, difficult to manage uniformly
Poor Retrieval Effectiveness: Traditional RAG's flat storage lacks global view, making it hard to understand complete context
Unobservable Context: Implicit retrieval chains are like black boxes, difficult to debug when errors occur
Limited Memory Iteration: Lacks Agent-related task memory and self-evolution capabilities

OpenViking unifies all context management through a filesystem paradigm, enabling tiered delivery and self-iteration.

What's the fundamental difference between OpenViking and traditional vector databases?

Dimension	Traditional Vector DB	OpenViking
Storage Model	Flat vector storage	Hierarchical filesystem (AGFS)
Retrieval Method	Single vector similarity search	Directory recursive retrieval + Intent analysis + Rerank
Output Format	Raw chunks	Structured context (L0 Abstract/L1 Overview/L2 Details)
Memory Capability	Not supported	Built-in 6 memory categories with auto-extraction and iteration
Observability	Black box	Fully traceable retrieval trajectory
Context Types	Documents only	Resource + Memory + Skill three types

What is the L0/L1/L2 layered model? Why is it needed?

L0/L1/L2 is OpenViking's progressive content loading mechanism, solving the problem of "stuffing massive context into prompts all at once":

Layer	Name	Token Limit	Purpose
L0	Abstract	~100 tokens	Vector search recall, quick filtering, list display
L1	Overview	~2000 tokens	Rerank refinement, content navigation, decision reference
L2	Details	Unlimited	Complete original content, on-demand deep loading

This design allows Agents to browse abstracts for quick positioning, then load details on demand, significantly saving token consumption.

What is Viking URI? What's its purpose?

Viking URI is OpenViking's unified resource identifier, formatted as viking://{scope}/{path}. It enables precise location of any context:

viking://
├── resources/              # Knowledge base: documents, code, web pages, etc.
│   └── my_project/
├── user/                   # User context
│   └── memories/           # User memories (preferences, entities, events)
└── agent/                  # Agent context
    ├── skills/             # Callable skills
    └── memories/           # Agent memories (cases, patterns)

Installation & Configuration

What are the environment requirements?

Python Version: 3.10 or higher
Build Tools (if installing from source or on unsupported platforms): Rust/Cargo, GCC 9+ or Clang 11+
Required Dependencies: Embedding model (Volcengine Doubao recommended)
Optional Dependencies:
- VLM (Vision Language Model): For multimodal content processing and semantic extraction
- Rerank model: For improved retrieval precision

What are `binding-client` and `http-client`? Which one should I choose?

binding-client (Default): Runs AGFS logic directly within the Python process via CGO bindings. Advantages: extremely high performance, zero network latency; Disadvantages: requires a compiled AGFS shared library locally.
http-client: Communicates with a standalone agfs-server via HTTP. Advantages: decoupled deployment, no local Go compilation needed; Disadvantages: some network communication overhead.

If your environment supports Go compilation or you've installed a Wheel package containing pre-compiled libraries, the default binding-client is recommended.

What should I do if I encounter "AGFS binding library not found"?

This usually means the AGFS shared library is not pre-built in your environment. You can:

Re-compile and install: Run pip install -e . --force-reinstall in the project root (requires Go environment).
Switch to HTTP mode: Set storage.agfs.mode = "http-client" in your ov.conf and ensure an agfs-server is running.

How do I install/upgrade OpenViking?

bash

pip install openviking --upgrade --force-reinstall

How do I configure OpenViking?

Create an ~/.openviking/ov.conf configuration file in your project directory:

json

{
  "embedding": {
    "dense": {
      "provider": "volcengine",
      "api_key": "your-api-key",
      "model": "doubao-embedding-vision-251215",
      "dimension": 1024,
      "input": "multimodal"
    }
  },
  "vlm": {
    "provider": "volcengine",
    "api_key": "your-api-key",
    "model": "doubao-seed-2-0-pro-260215",
    "api_base": "https://ark.cn-beijing.volces.com/api/v3"
  },
  "rerank": {
    "provider": "volcengine",
    "api_key": "your-api-key",
    "model": "doubao-rerank-250615"
  },
  "storage": {
    "workspace": "./data",
    "agfs": { "backend": "local" },
    "vectordb": { "backend": "local" }
  }
}

Config files at the default path ~/.openviking/ov.conf are loaded automatically; you can also specify a different path via the OPENVIKING_CONFIG_FILE environment variable or --config flag. See Configuration Guide for details.

What Embedding providers are supported?

Provider	Description
`volcengine`	Volcengine Embedding API (Recommended)
`openai`	OpenAI Embedding API
`vikingdb`	VikingDB Embedding API
`jina`	Jina AI Embedding API
`ollama`	Ollama (local OpenAI-compatible server, no API key required)

Supports Dense, Sparse, and Hybrid embedding modes.

Usage Guide

How do I initialize the client?

python

import openviking as ov

# Async client - embedded mode (recommended)
client = ov.AsyncOpenViking(path="./my_data")
await client.initialize()

# Async client - HTTP client mode
client = ov.AsyncHTTPClient(url="http://localhost:1933", api_key="your-key")
await client.initialize()

The SDK constructor only accepts url, api_key, and path parameters. Other configuration (embedding, vlm, etc.) is managed through the ov.conf config file.

What file formats are supported?

Type	Supported Formats
Text	`.txt`, `.md`, `.json`, `.yaml`
Code	`.py`, `.js`, `.ts`, `.go`, `.java`, `.cpp`, etc.
Documents	`.pdf`, `.docx`
Images	`.png`, `.jpg`, `.jpeg`, `.gif`, `.webp`
Video	`.mp4`, `.mov`, `.avi`
Audio	`.mp3`, `.wav`, `.m4a`

How do I add resources?

python

# Add single file
await client.add_resource(
    "./document.pdf",
    reason="Project technical documentation",  # Describe resource purpose to improve retrieval quality
    target="viking://resources/docs/"  # Specify storage location
)

# Add web page
await client.add_resource(
    "https://example.com/api-docs",
    reason="API reference documentation"
)

# Wait for processing to complete
await client.wait_processed()

What's the difference between `find()` and `search()`? Which should I use?

Feature	`find()`	`search()`
Session Context	Not required	Required
Intent Analysis	Not used	Uses LLM to analyze and generate 0-5 queries
Latency	Low	Higher
Use Case	Simple semantic search	Complex tasks requiring context understanding

python

# find(): Simple direct semantic search
results = await client.find(
    "OAuth authentication flow",
    target_uri="viking://resources/"
)

# search(): Complex tasks requiring intent analysis
results = await client.search(
    "Help me implement user login functionality",
    session_info=session
)

Selection Guide:

Know exactly what you're looking for → Use find()
Complex tasks needing multiple context types → Use search()

How do I use session management?

Session management is a core capability of OpenViking, supporting conversation tracking and memory extraction:

python

# Create session
session = client.session()

# Add conversation messages
await session.add_message("user", [{"type": "text", "text": "Help me analyze performance issues in this code"}])
await session.add_message("assistant", [{"type": "text", "text": "Let me analyze..."}])

# Mark used context (for tracking)
await session.used(["viking://resources/code/main.py"])

# Commit session to trigger memory extraction
await session.commit()

What memory types does OpenViking support?

OpenViking has 6 built-in memory categories, automatically extracted during session commit:

Category	Belongs To	Description
profile	user	User basic info (name, role, etc.)
preferences	user	User preferences (code style, tool choices, etc.)
entities	user	Entity memories (people, projects, organizations, etc.)
events	user	Event records (decisions, milestones, etc.)
cases	agent	Cases learned by Agent
patterns	agent	Patterns learned by Agent

How do I use Unix-like filesystem APIs?

python

# List directory contents
items = await client.ls("viking://resources/")

# Read full content (L2)
content = await client.read("viking://resources/doc.md")

# Get abstract (L0)
abstract = await client.abstract("viking://resources")

# Get overview (L1)
overview = await client.overview("viking://resources")

Retrieval Optimization

How do I improve retrieval quality?

Use Rerank model: Configuring Rerank significantly improves ranking effectiveness
Provide meaningful reason: Describe purpose when adding resources to help system understand resource value
Organize directory structure properly: Use target parameter to group related resources together
Use session context: search() leverages session history for intent analysis
Choose appropriate Embedding mode: Use multimodal input for multimodal content

How is the retrieval result score calculated?

OpenViking uses a score propagation mechanism:

Final Score = 0.5 × Embedding Similarity + 0.5 × Parent Directory Score

This design gives content under high-scoring directories a boost, reflecting the importance of "contextual environment".

What is directory recursive retrieval?

Directory recursive retrieval is OpenViking's innovative retrieval strategy:

Intent Analysis: Analyze query to generate multiple retrieval conditions
Initial Positioning: Vector retrieval to locate high-scoring directories
Refined Exploration: Secondary retrieval within high-scoring directories
Recursive Drill-down: Layer-by-layer recursion until convergence
Result Aggregation: Return the most relevant context

This strategy finds semantically matching fragments while understanding the complete context of the information.

Troubleshooting

Resources not being indexed after adding

Possible causes and solutions:

Didn't wait for processing to complete

python

await client.add_resource("./doc.pdf")
await client.wait_processed()  # Must wait

Embedding model configuration error
- Check if api_key in ~/.openviking/ov.conf is correct
- Confirm model name and endpoint are configured correctly
Unsupported file format
- Check if file extension is in the supported list
- Confirm file content is valid and not corrupted

View processing logs

python

import logging
logging.basicConfig(level=logging.DEBUG)

Search not returning expected results

Troubleshooting steps:

Confirm resources have been processed

python

# Check if resources exist
items = await client.ls("viking://resources/")

Check target_uri filter condition
- Ensure search scope includes target resources
- Try expanding search scope
Try different query approaches
- Use more specific or broader keywords
- Compare effects of find() and search()

Check L0 abstract quality

python

abstract = await client.abstract("viking://resources/your-doc")
print(abstract)  # Confirm abstract accurately reflects content

Memory extraction not working

Troubleshooting steps:

Ensure commit() was called

python

await session.commit()  # Triggers memory extraction

Check VLM configuration
- Memory extraction requires VLM model
- Confirm vlm configuration is correct
Confirm conversation content is meaningful
- Casual chat may not produce memories
- Needs to contain extractable information (preferences, entities, events, etc.)

View extracted memories

python

memories = await client.find("", target_uri="viking://user/memories/")

Performance issues

Optimization suggestions:

Batch processing: Adding multiple resources at once is more efficient than one by one
Set appropriate batch_size: Adjust batch processing size in Embedding configuration
Use local storage: Use local backend during development to reduce network latency
Async operations: Fully utilize AsyncOpenViking / AsyncHTTPClient's async capabilities

Deployment

What's the difference between embedded mode and service mode?

Mode	Use Case	Characteristics
Embedded	Local development, single-process apps	Auto-starts AGFS subprocess, uses local vector index
Service Mode	Production, distributed deployment	Connects to remote services, supports multi-instance concurrency, independently scalable

python

# Embedded mode
client = ov.AsyncOpenViking(path="./data")

# HTTP client mode (connects to a remote server)
client = ov.AsyncHTTPClient(url="http://localhost:1933", api_key="your-key")

Is OpenViking open source?

Yes, OpenViking main project is open source under the AGPL-3.0 license, and examples/ and crates/ov_cli are licensed under the Apache 2.0 license.

Introduction - Understand OpenViking's design philosophy
Quick Start - 5-minute tutorial
Architecture Overview - Deep dive into system design
Retrieval Mechanism - Detailed retrieval process
Configuration Guide - Complete configuration reference

FAQ

FAQ

Basic Concepts

What is OpenViking? What problems does it solve?

What's the fundamental difference between OpenViking and traditional vector databases?

What is the L0/L1/L2 layered model? Why is it needed?

What is Viking URI? What's its purpose?

Installation & Configuration

What are the environment requirements?

What are binding-client and http-client? Which one should I choose?

What should I do if I encounter "AGFS binding library not found"?

How do I install/upgrade OpenViking?

How do I configure OpenViking?

What Embedding providers are supported?

Usage Guide

How do I initialize the client?

What file formats are supported?

How do I add resources?

What's the difference between find() and search()? Which should I use?

How do I use session management?

What memory types does OpenViking support?

How do I use Unix-like filesystem APIs?

Retrieval Optimization

How do I improve retrieval quality?

How is the retrieval result score calculated?

What is directory recursive retrieval?

Troubleshooting

Resources not being indexed after adding

Search not returning expected results

Memory extraction not working

Performance issues

Deployment

What's the difference between embedded mode and service mode?

Is OpenViking open source?

Related Documentation

What are `binding-client` and `http-client`? Which one should I choose?

What's the difference between `find()` and `search()`? Which should I use?