cookbook/data_labeling/_16_document_extraction/TEST_LOG.md
Tested 2026-05-17 against gpt-5.5 (OpenAIResponses), agno 2.6.6. Input is agno-public/recipes/ThaiRecipes.pdf.
Status: PASS
Description: Extract document-level metadata (title, cuisine, language, recipe_count) into a RecipeBook.
Result: Title "Thai SELECT Cookbook", cuisine "Thai cuisine", language "English", recipe_count 10.
Status: PASS
Description: Same task with ConfidentField wrapping each value.
Result: All four fields populated with sensible confidences.
Note: Originally failed with Invalid schema for response_format 'RecipeBook': $ref cannot have keywords {'description'} — OpenAI's strict structured-output mode rejects a description on a field whose type is itself a referenced model. Fix: removed the Field(..., description=...) annotation on recipe_count (kept the explanatory text as a code comment).
Status: PASS
Description: Extract a RecipeBook with a nested list of Recipe line items (name, course, prep time).
Result: Multiple recipes extracted with full nested structure.