Back to Agno

Reliability Eval Cookbooks

cookbook/09_evals/reliability/README.md

2.6.4499 B
Original Source

Reliability Eval Cookbooks

Reliability examples validate whether expected tool calls are made correctly.

Files

  • db_logging.py - Reliability evaluation with PostgreSQL logging.
  • reliability_async.py - Asynchronous reliability evaluation flow.
  • single_tool_calls/calculator.py - Single expected calculator tool call reliability.
  • multiple_tool_calls/calculator.py - Multi-tool calculator call reliability.
  • team/ai_news.py - Team delegation and web-search tool call reliability.