examples/eval-image-classification/README.md
You can run this example with:
npx promptfoo@latest init --example eval-image-classification
cd eval-image-classification
This example demonstrates how to use Promptfoo for image classification tasks using the Fashion MNIST dataset. The example uses GPT-4o and GPT-4o-mini with a structured json schema to analyze images, including classification, color analysis, and additional attributes.
Set up your OpenAI API key:
export OPENAI_API_KEY='your-api-key'
Run the evaluation:
npx promptfoo@latest eval
View the results:
npx promptfoo@latest view
Optionally, re-generate or update the dataset:
python dataset_gen.py
Note: You may need to install dependencies with:
pip install -r requirements.txt
This script creates a CSV file with 100 random images from the Fashion MNIST dataset and their labels. A CSV with 10 sample images is included so you can skip this step if preferred.
Experiment with the configuration:
promptfooconfig.yaml to add or adjust required fields