Back to Opik

Home

apps/opik-documentation/documentation/fern/docs-v2/home.mdx

2.0.55-6946-merge-22144.8 KB
Original Source

Opik is an open-source platform that helps you understand what your LLM application is doing, measure how well it's working, and systematically make it better. Whether you're building a chatbot, a RAG pipeline, or a multi-step agent, Opik gives you the tools to go from "it works on my laptop" to "it works reliably in production."

End-to-End AI Engineering

<Frame> </Frame> <Tip> Opik is Open Source! You can find the full source code on [GitHub](https://github.com/comet-ml/opik) and the complete self-hosting guide can be found [here](/self-host/local_deployment). </Tip>

How to use Opik

<Tip> **Using Claude Code, Cursor, or VS Code Copilot?** Install the [Opik MCP server](/mcp-server) and drive your entire workspace from chat — read traces, score outputs, save prompts, and run experiments without opening the UI. </Tip> <Steps> ### See what your application is doing

Opik records every LLM call, tool invocation, and agent step so you can inspect the full chain of events that led to any output. Add a few lines of code and you'll have a complete log of every request and response.

Start logging traces →

Build test suites from your traces

When you spot a trace that looks wrong, turn it into a test case. Use Ollie to do this automatically (just describe what went wrong), or add test cases through the UI or SDK. Then run your test suite with Ollie or from the SDK to verify your fixes.

Over time, your test suite grows from real production failures, not hypothetical examples.

Build your first test suite →

Track quality in production

Set up online evaluation rules that automatically score incoming traces, and monitor feedback scores, latency, cost, and error rates from the project dashboard.

Set up production monitoring →

Automatically improve your prompts

Opik's optimization algorithms test variations of your prompts against your metrics and datasets to find what works best, without manual trial and error.

Run your first optimization → </Steps>

Explore by feature

<CardGroup cols={2}> <Card title="Quickstart" href="/quickstart" icon="fa-solid fa-rocket" iconPosition="left"> Get Opik running with your existing AI stack in minutes. Works with OpenAI, Anthropic, LangChain, and 50+ other providers and frameworks. </Card> <Card title="MCP Server" href="/mcp-server" icon="fa-solid fa-network-wired" iconPosition="left"> Connect Claude Code, Cursor, or VS Code Copilot directly to your Opik workspace. Read traces, score outputs, and run experiments from chat — no UI required. </Card> <Card title="Log traces" href="/tracing/advanced/log_traces" icon="fa-solid fa-eye" iconPosition="left"> Record every LLM call, tool invocation, and agent step. Debug failures, track token costs, and understand what your application is doing. </Card> <Card title="Evaluate performance" href="/evaluation/overview" icon="fa-solid fa-chart-line" iconPosition="left"> Score your application on hallucination, context recall, relevance, and more using automated LLM-as-a-judge and heuristic metrics. </Card> <Card title="Optimize prompts" href="/development/optimization-runs/overview" icon="fa-solid fa-brain" iconPosition="left"> Automatically generate and test better prompts for every step in your agent using six optimization algorithms. </Card> <Card title="Manage prompts" href="/development/prompt-library/getting-started" icon="fa-solid fa-wand-magic-sparkles" iconPosition="left"> Store and version your prompts, compare results in the [Prompt Playground](/development/prompt-playground), and experiment with different models. </Card> <Card title="Self-host Opik" href="/self-host/overview" icon="fa-solid fa-server" iconPosition="left"> Deploy on your own infrastructure with Docker locally or Kubernetes at scale. Full control over your data. </Card> </CardGroup>

See it in action

<Frame> <iframe width="100%" height="500px" src="https://www.youtube-nocookie.com/embed/TO9ar6-OJj4?rel=0" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share; fullscreen" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen ></iframe> </Frame>

Open-source access meets enterprise performance

All Opik versions (cloud, open source, and enterprise) include the full AI engineering feature set and run on the Comet platform, with proven performance at scale supporting many of the world's largest organizations.