services/computer-use-mcp/FEASIBILITY.md
This document records the validated state of the AIRI-specific macOS desktop orchestration v1 in services/computer-use-mcp.
The current direction is feasible and materially stronger than the earlier pure-vision-only path. The validated architecture is now:
computer-use-mcp keeps trace, audit, screenshot persistence, policy, and the MCP surfaceThat makes the feature an orchestration layer, not just a mouse-clicking demo.
AIRI continues to use the existing stdio MCP bridge through mcp.json.
No transport rewrite was required.
Validated surface in this checkout:
This is a better fit for AIRI than leading with pure screenshot-driven action selection.
Validated by tests:
This gives AIRI a deterministic execution path for many developer workflows without relying on Terminal.app scripting.
Validated by implementation shape:
approval_requiredcomputer_use pending actionsThat keeps approval as a user action, not a model action.
The previous linux-x11 backend remains available as a legacy experimental path.
It is not the primary v1 story anymore, but it was intentionally kept compiling so existing remote smoke tooling still works.
Main v1 story:
macos-localTerminalCursorGoogle ChromedenyAppsExplicit non-goals of this pass:
pnpm -F @proj-airi/computer-use-mcp typecheckpnpm -F @proj-airi/computer-use-mcp testpnpm -F @proj-airi/stage-ui typecheckpnpm -F @proj-airi/stage-tamagotchi typecheckThe feature is now credible as:
It should not be pitched as: