examples/openai-chatkit/README.md
Evaluate ChatKit workflows built with OpenAI's Agent Builder using browser automation.
npx promptfoo@latest init --example openai-chatkit
cd openai-chatkit
npx playwright install chromium
Get your workflow ID from Agent Builder:
wf_68ffb83dbfc88190a38103c2bb9f421003f913035dbdb131)Update promptfooconfig.yaml with your workflow ID
OPENAI_API_KEY - Your OpenAI API keynpx promptfoo@latest eval --max-concurrency 4
View results:
npx promptfoo@latest view
For workflows that ask follow-up questions, enable stateful mode:
providers:
- id: openai:chatkit:wf_YOUR_WORKFLOW_ID
config:
stateful: true
tests:
- vars:
message: 'I want to plan a party'
- vars:
message: 'For 20 people with a $500 budget'
Run with --max-concurrency 1 for stateful mode.
Test multi-turn conversations automatically:
providers:
- id: openai:chatkit:wf_YOUR_WORKFLOW_ID
config:
stateful: true
defaultTest:
provider:
id: 'promptfoo:simulated-user'
config:
maxTurns: 5
tests:
- vars:
instructions: |
You are planning a birthday party.
Answer questions naturally and provide details when asked.