Back to Agno

Image Extraction

cookbook/data_labeling/_07_image_extraction/README.md

2.6.81.1 KB
Original Source

Image Extraction

Image → typed Pydantic object. Same shape as text extraction with image input: descriptive attributes, OCR'd fields, structured metadata.

Files

  • basic.py — image → typed scene attributes.
  • with_confidence.py — adds per-field confidence.
  • ocr_fields.py — extract text-heavy fields from an image (sign, receipt, product label).

When to use

  • Auto-cataloging product photos (color, style, type).
  • Pre-filling form fields from a photo (receipt, business card).
  • Generating searchable metadata for a media archive (see also _09_image_extraction_to_vectordb/).

If you only need a label, use _06_image_classification/. If you need pixel regions, use _08_image_bounding_boxes/.

Run

bash
python cookbook/data_labeling/_07_image_extraction/basic.py
python cookbook/data_labeling/_07_image_extraction/with_confidence.py
python cookbook/data_labeling/_07_image_extraction/ocr_fields.py

Requires OPENAI_API_KEY. Swap the URLs for your own images as needed.