Release process/changelog

The project follows a slightly modified version of semantic versioning using the form 0.Y.Z. The leading 0 indicates the SDK is still evolving rapidly. Increment the components as follows:

Minor (`Y`) versions

We will increase minor versions Y for breaking changes to any public interfaces that are not marked as beta. For example, going from 0.0.x to 0.1.x might include breaking changes.

If you don't want breaking changes, we recommend pinning to 0.0.x versions in your project.

Patch (`Z`) versions

We will increment Z for non-breaking changes:

Bug fixes
New features
Changes to private interfaces
Updates to beta features

Breaking change changelog

0.15.0

In this version, model refusals are now surfaced explicitly as ModelRefusalError instead of being treated as empty text output or, for structured outputs, causing the run loop to retry until MaxTurnsExceeded.

This affects code that previously expected a refusal-only model response to complete with final_output == "". To handle refusals without raising, provide a model_refusal run error handler:

python

result = Runner.run_sync(
    agent,
    input,
    error_handlers={"model_refusal": lambda data: data.error.refusal},
)

For structured-output agents, the handler can return a value matching the agent's output schema, and the SDK will validate it like other run error handler final outputs.

0.14.0

This minor release does not introduce a breaking change, but it adds a major new beta feature area: Sandbox Agents, plus the runtime, backend, and documentation support needed to use them across local, containerized, and hosted environments.

Highlights:

Added a new beta sandbox runtime surface centered on SandboxAgent, Manifest, and SandboxRunConfig, letting agents work inside persistent isolated workspaces with files, directories, Git repos, mounts, snapshots, and resume support.
Added sandbox execution backends for local and containerized development via UnixLocalSandboxClient and DockerSandboxClient, plus hosted provider integrations for Blaxel, Cloudflare, Daytona, E2B, Modal, Runloop, and Vercel through optional extras.
Added sandbox memory support so future runs can reuse lessons from prior runs, with progressive disclosure, multi-turn grouping, configurable isolation boundaries, and persisted-memory examples including S3-backed workflows.
Added a broader workspace and resume model, including local and synthetic workspace entries, remote storage mounts for S3/R2/GCS/Azure Blob Storage/S3 Files, portable snapshots, and resume flows via RunState, SandboxSessionState, or saved snapshots.
Added substantial sandbox examples and tutorials under examples/sandbox/, covering coding tasks with skills, handoffs, memory, provider-specific setups, and end-to-end workflows such as code review, dataroom QA, and website cloning.
Extended the core runtime and tracing stack with sandbox-aware session preparation, capability binding, state serialization, unified tracing, prompt cache key defaults, and safer sensitive MCP output redaction.

0.13.0

This minor release does not introduce a breaking change, but it includes a notable Realtime default update plus new MCP capabilities and runtime stability fixes.

Highlights:

The default websocket Realtime model is now gpt-realtime-1.5, so new Realtime agent setups use the newer model without extra configuration.
MCPServer now exposes list_resources(), list_resource_templates(), and read_resource(), and MCPServerStreamableHttp now exposes session_id so streamable HTTP sessions can be resumed across reconnects or stateless workers.
Chat Completions integrations can now opt into reasoning-content replay via should_replay_reasoning_content, improving provider-specific reasoning/tool-call continuity for adapters such as LiteLLM/DeepSeek.
Fixed several runtime and session edge cases, including concurrent first writes in SQLAlchemySession, compaction requests with orphaned assistant message IDs after reasoning stripping, remove_all_tools() leaving MCP/reasoning items behind, and a race in the function-tool batch executor.

0.12.0

This minor release does not introduce a breaking change. Check the release notes for major feature additions.

0.11.0

This minor release does not introduce a breaking change. Check the release notes for major feature additions.

0.10.0

This minor release does not introduce a breaking change, but it includes a significant new feature area for OpenAI Responses users: websocket transport support for the Responses API.

Highlights:

Added websocket transport support for OpenAI Responses models (opt-in; HTTP remains the default transport).
Added a responses_websocket_session() helper / ResponsesWebSocketSession for reusing a shared websocket-capable provider and RunConfig across multi-turn runs.
Added a new websocket streaming example (examples/basic/stream_ws.py) covering streaming, tools, approvals, and follow-up turns.

0.9.0

In this version, Python 3.9 is no longer supported, as this major version reached EOL three months ago. Please upgrade to a newer runtime version.

Additionally, the type hint for the value returned from the Agent#as_tool() method has been narrowed from Tool to FunctionTool. This change should not usually cause breaking issues, but if your code relies on the broader union type, you may need to make some adjustments on your side.

0.8.0

In this version, two runtime behavior changes may require migration work:

Function tools wrapping synchronous Python callables now execute on worker threads via asyncio.to_thread(...) instead of running on the event loop thread. If your tool logic depends on thread-local state or thread-affine resources, migrate to an async tool implementation or make thread affinity explicit in your tool code.
Local MCP tool failure handling is now configurable, and the default behavior can return model-visible error output instead of failing the whole run. If you rely on fail-fast semantics, set mcp_config={"failure_error_function": None}. Server-level failure_error_function values override the agent-level setting, so set failure_error_function=None on each local MCP server that has an explicit handler.

0.7.0

In this version, there were a few behavior changes that can affect existing applications:

Nested handoff history is now opt-in (disabled by default). If you depended on the v0.6.x default nested behavior, explicitly set RunConfig(nest_handoff_history=True).
The default reasoning.effort for gpt-5.1 / gpt-5.2 changed to "none" (from the previous default "low" configured by SDK defaults). If your prompts or quality/cost profile relied on "low", set it explicitly in model_settings.

0.6.0

In this version, the default handoff history is now packaged into a single assistant message instead of exposing the raw user/assistant turns, giving downstream agents a concise, predictable recap

The existing single-message handoff transcript now by default starts with "For context, here is the conversation so far between the user and the previous agent:" before the <CONVERSATION HISTORY> block, so downstream agents get a clearly labeled recap

0.5.0

This version doesn’t introduce any visible breaking changes, but it includes new features and a few significant updates under the hood:

Added support for RealtimeRunner to handle SIP protocol connections
Significantly revised the internal logic of Runner#run_sync for Python 3.14 compatibility

0.4.0

In this version, openai package v1.x versions are no longer supported. Please use openai v2.x along with this SDK.

0.3.0

In this version, the Realtime API support migrates to gpt-realtime model and its API interface (GA version).

0.2.0

In this version, a few places that used to take Agent as an arg, now take AgentBase as an arg instead. For example, the list_tools() call in MCP servers. This is a purely typing change, you will still receive Agent objects. To update, just fix type errors by replacing Agent with AgentBase.

0.1.0

In this version, [MCPServer.list_tools()][agents.mcp.server.MCPServer] has two new params: run_context and agent. You'll need to add these params to any classes that subclass MCPServer.

Release process/changelog

Release process/changelog

Minor (Y) versions

Patch (Z) versions

Breaking change changelog

0.15.0

0.14.0

0.13.0

0.12.0

0.11.0

0.10.0

0.9.0

0.8.0

0.7.0

0.6.0

0.5.0

0.4.0

0.3.0

0.2.0

0.1.0

Minor (`Y`) versions

Patch (`Z`) versions