examples/integration-tau/README.md
You can run this example with:
npx promptfoo@latest init --example integration-tau
cd integration-tau
This example demonstrates testing conversational AI agents using OpenAI's Responses API with function calling. It simulates an airline booking system with 31 different customer personas to test how well agents handle realistic conversations.
The example uses mocked airline functions to simulate a booking system without requiring real APIs:
export OPENAI_API_KEY=your_api_key_herepromptfoo evalpromptfoo viewRealistic multi-turn conversations between different customer types and the booking agent:
User: I need a flight from New York to Seattle on May 20th
Agent: I'd be happy to help! May I have your user ID?
User: It's mia_li_3668
Agent: Thank you! I found these options: Direct flight $325, One-stop $295
User: I'll take the cheaper United flight
Agent: Perfect! Your flight is confirmed. Confirmation: CF8X2M1K
For more information about the Simulated User Provider and other promptfoo features, visit the documentation at promptfoo.dev.