examples/anthropic/opus-4-7-coding/README.md
This example exercises Claude Opus 4.7 on hard coding tasks using the new xhigh effort level.
You can run this example with:
npx promptfoo@latest init --example anthropic/opus-4-7-coding
cd opus-4-7-coding
Claude Opus 4.7 is Anthropic's latest model for agentic coding. This example evaluates:
xhigh effort level: sits between high and max, giving finer control over the reasoning/latency tradeoff on hard problems. Recommended as a starting point for coding and agentic use cases.# Set your API key
export ANTHROPIC_API_KEY=your_api_key_here
# Run the evaluation
npx promptfoo@latest eval
# View results
npx promptfoo@latest view
Opus 4.7 is also reachable through:
bedrock:us.anthropic.claude-opus-4-7 (or bedrock:converse:us.anthropic.claude-opus-4-7)vertex:claude-opus-4-7 with config.region: globalanthropic:messages:claude-opus-4-7 at https://<resource>.services.ai.azure.com/anthropic via apiBaseUrlPromptfoo suppresses temperature on Opus 4.7 across all four providers (the model deprecated it at the model level) and warns once per provider instance if the user set it explicitly.