examples/openai-agents-advanced/README.md
This example exercises the OpenAI Agents SDK TypeScript features that matter once you move beyond a single-turn agent:
MemorySession historySandboxAgent execution with the SDK local sandbox clientOPENAI_API_KEYnpx promptfoo@latest init --example openai-agents-advanced
cd openai-agents-advanced
Or, from a cloned repository:
cd examples/openai-agents-advanced
npm install
npx promptfoo eval -c promptfooconfig.yaml --no-cache -j 1
The second test depends on the first test's remembered code word, so run this config with -j 1.
For stateful red-team strategies, use a session factory keyed by a per-test sessionId rather than one shared inline session. The OpenAI Agents provider docs show the transformVars plus session-factory pattern that keeps turns together without sharing history across unrelated tests; the multi-turn strategy docs explain when stateful: true is appropriate.
npx promptfoo eval -c promptfooconfig.sandbox.yaml --no-cache
The sandbox eval mounts a synthetic task.md, asks the agent to use the ticket-summary skill, and asserts on traced shell activity plus the final answer.
See also the Tracing docs for trajectory assertions and the OpenAI Agents provider docs for the full JavaScript SDK configuration surface.