azure/comparison (Azure Model Comparison)

This example demonstrates how to compare models from different providers on Azure AI Foundry, including OpenAI, Anthropic Claude, Meta Llama, and Mistral.

You can run this example with:

bash

npx promptfoo@latest init --example azure/comparison
cd azure/comparison

Setup

Deploy models from different providers in Azure AI Foundry
Set your environment variables:

bash

export AZURE_API_KEY=your-api-key
# Set apiHost in promptfooconfig.yaml for each provider's deployment

Models Compared

Provider	Model	Label
OpenAI	`gpt-5.1`	gpt-5.1
Anthropic	`claude-sonnet-4-6`	claude-sonnet
Meta	`Llama-4-Maverick-17B-128E-Instruct-FP8`	llama-4
Mistral	`Mistral-Large-2411`	mistral-large

Running the Example

bash

npx promptfoo@latest eval
npx promptfoo@latest view

Customization

Modify promptfooconfig.yaml to:

Add or remove models
Change test questions
Adjust evaluation criteria
Compare cost vs performance

Use Cases

Benchmark different models on your specific tasks
Evaluate cost-effectiveness across providers
Find the best model for your use case
A/B test model updates

azure/comparison (Azure Model Comparison)

azure/comparison (Azure Model Comparison)

Setup

Models Compared

Running the Example

Customization

Use Cases

Documentation