apps/opik-documentation/documentation/fern/docs-v2/home.mdx
Opik is an open-source platform that helps you understand what your LLM application is doing, measure how well it's working, and systematically make it better. Whether you're building a chatbot, a RAG pipeline, or a multi-step agent, Opik gives you the tools to go from "it works on my laptop" to "it works reliably in production."
Opik records every LLM call, tool invocation, and agent step so you can inspect the full chain of events that led to any output. Add a few lines of code and you'll have a complete log of every request and response.
When you spot a trace that looks wrong, turn it into a test case. Use Ollie to do this automatically (just describe what went wrong), or add test cases through the UI or SDK. Then run your test suite with Ollie or from the SDK to verify your fixes.
Over time, your test suite grows from real production failures, not hypothetical examples.
Set up online evaluation rules that automatically score incoming traces, and monitor feedback scores, latency, cost, and error rates from the project dashboard.
Set up production monitoring →
Opik's optimization algorithms test variations of your prompts against your metrics and datasets to find what works best, without manual trial and error.
Run your first optimization → </Steps>
All Opik versions (cloud, open source, and enterprise) include the full AI engineering feature set and run on the Comet platform, with proven performance at scale supporting many of the world's largest organizations.