apps/opik-documentation/documentation/fern/docs/opik-university/3_evaluation/3.2-evaluation-create-datasets.mdx
This hands-on video demonstrates dataset creation using a practical RAG (Retrieval Augmented Generation) example that compares OpenAI and Google Gemini models. You'll learn how evaluation datasets serve as the foundation for systematic LLM testing - they're collections of example inputs your application will encounter along with expected outputs, similar to validation sets in traditional machine learning.
get_or_create_dataset()@track decorators