Agent Browser QA Principles

This document fixes the direction of LangBot agent testing so the project does not drift into a backend API smoke-test framework.

Primary Goal

langbot-skills should help an agent behave like a QA engineer using the product, not like a backend curl script.

The primary path is:

text

developer intent -> lbs test plan -> agent controls browser -> UI result + console + logs -> report/assets

Browser/UI interaction is the source of truth for product QA cases.
A backend API or curl response is never enough to mark a UI case passed.
API/curl/log checks are allowed as diagnostics after a UI path is attempted or when debugging environment readiness.
A case passes only when the user-visible UI result is correct.
The agent should inspect browser console/network output when available.
If screenshot or vision capability is available, the agent should check for blank pages, overlap, hidden actions, broken layout, and error toasts.
If no visual model is available, use DOM/accessibility snapshots and console output instead.
New stable UI paths should be added as cases/*.yaml.
New recurring failure modes should be added as troubleshooting/*.yaml.
Secrets, tokens, API keys, and localStorage token values must never be printed.

lbs manages assets and produces plans. It does not replace the agent's browser-control ability.

bash

bin/lbs test plan pipeline-debug-chat

This command outputs:

The active agent then executes the plan with Computer Use, Playwright MCP, or another available browser-control tool.

Diagnostics can include:

Diagnostics answer "where did it fail?" They do not replace "did the user-visible UI work?"