apps/opik-documentation/documentation/fern/docs/opik-university/3_evaluation/3.1-evaluation-concepts-overview.mdx
This video introduces the fundamentals of LLM evaluation and why it differs from traditional machine learning metrics. Unlike conventional ML evaluation that relies on accuracy and F1 scores, LLM evaluation requires assessing text qualities like relevance, accuracy, and helpfulness. You'll learn about Opik's systematic three-component evaluation framework and see how it enables quantitative performance measurement across hundreds of test cases.