docs/guides/browser.md
Agent Zero has a built-in Browser for real web pages.
Use it for research, forms, screenshots, UI review, downloads, extensions, and anything else that works best in a browser.
The Browser has two connected parts:
The Browser surface does not open automatically every time the agent browses. Open it when you want to watch, steer, or annotate the page.
The surface shows Browser tabs, back/forward/reload controls, an address bar, an annotation toggle, and Browser settings.
You can ask naturally:
Use the Browser tool to open https://example.com, read the page content, and take a screenshot. Keep the response short.
The agent can:
When a page is read, Agent Zero gets simple references such as [link 1],
[button 2], or [input text 3]. It can use those references to act on the
right part of the page.
list
state
set_active
navigate
back
forward
reload
hover
double_click
right_click
drag
scroll
evaluate
key_chord
mouse
wheel
keyboard
clipboard
set_viewport
multi
close
close_all
When Agent Zero takes a Browser screenshot, the image is saved and shown in the chat history.
Many Browser steps also keep a small history screenshot. That means an older chat can show the page as it looked when the agent worked on it, not just the latest page frame.
Annotate mode lets you mark a page element or region and send a targeted comment back into the chat. This is useful for UI review: you can point at the exact thing that needs to change instead of describing it from memory.
Open Browser settings from the Browser toolbar or from the Browser plugin settings.
The toolbar menu includes:
The full settings include:
The Docker browser is the default. It is a separate browser inside Agent Zero's Docker environment, and it is the browser shown in the live Browser surface.
Use Docker browser mode when you want a clean, separate browser that Agent Zero can show in the Canvas.
In normal Docker installs, the needed browser is already included. In local development, Agent Zero can install it the first time it is needed.
Bring Your Own Browser lets Agent Zero use Chrome, Edge, or Chromium on your own computer through A0 CLI.
Use it when the page, login, or browser profile should stay on your machine.
Requirements:
chrome://inspect/#remote-debugging, and enable Allow remote debugging for this browser instance.The first time Agent Zero tries to operate that browser, Chrome shows an Allow remote debugging? prompt. Click Allow if you trust the connected Agent Zero instance and A0 CLI session.
[!IMPORTANT] Remote debugging grants full control of that browser session, including access to saved data, cookies, site data, and navigation. Enable it only for browser instances you intend Agent Zero to control.
Browser settings decide what Agent Zero may do with page text and screenshots from your own browser:
[!NOTE] The live Browser surface shows the Docker browser. When Agent Zero uses your host browser, page results and screenshots appear in the chat, but the live Canvas is not a stream of your personal browser window.
For setup details, profiles, and troubleshooting, see the A0 CLI Connector guide.
Browser can load Chrome extensions into the Docker browser.
Only enable extensions you trust. They run inside the Docker browser, but they can still change what happens in that browser.
Start with Agent Zero's built-in Browser.
Use an MCP browser option only when you specifically need another browser tool or an external automation service.
Common alternatives include:
See MCP Setup for MCP setup.
PLAYWRIGHT_BROWSERS_PATH=tmp/playwright playwright install chromium./browser status in A0 CLI.