packages/computeruse/README.md
Give AI assistants (Claude, Cursor, VS Code, etc.) the ability to control your desktop and automate tasks across any application.
Claude Code (one-liner):
claude mcp add computeruse "npx -y computeruse-mcp-agent@latest"
Other clients (Cursor, VS Code, Windsurf, etc.):
Add to your MCP config file:
{
"mcpServers": {
"computeruse-mcp-agent": {
"command": "npx",
"args": ["-y", "computeruse-mcp-agent@latest"],
"env": {
"LOG_LEVEL": "info",
"RUST_BACKTRACE": "1"
}
}
}
}
See the MCP Agent README for detailed setup instructions.
95% success rate unlike most computer use overhyped products
We achieve this by pre-training workflows as deterministic code, and calling AI only when recovery is needed.
Our public beta workflow builder + managed hosting:
ComputerUse supports Windows, macOS, and Linux.
| Feature | Windows | macOS | Linux | Notes |
|---|---|---|---|---|
| Core Automation | ||||
| Element Locators | ✅ | ✅ | ✅ | Find elements by name, role, window, etc. |
UI Actions (click, type) | ✅ | ✅ | ✅ | Core interactions with UI elements. |
| Application Management | ✅ | ✅ | ✅ | Launch, list, and manage applications. |
| Window Management | ✅ | ✅ | ✅ | Get active window, list windows. |
| Advanced Features | ||||
| Browser Automation | ✅ | ✅ | ✅ | Chrome extension enables browser control. |
| Workflow Recording | ✅ | 🟡 | 🟡 | Record human workflows for deterministic automation. |
| Monitor Management | ✅ | ✅ | ✅ | Multi-display support. |
| Screen & Element Capture | ✅ | ✅ | ✅ | Take screenshots of displays or elements. |
| Libraries | ||||
Python (computeruse.py) | 🟡 | 🟡 | 🟡 | pip install computeruse |
TypeScript (@elizaos/computeruse) | ✅ | ✅ | ✅ | npm i @elizaos/computeruse |
Workflow (@mediar-ai/workflow) | ✅ | 🟡 | 🟡 | npm i @mediar-ai/workflow |
CLI (@mediar-ai/cli) | ✅ | ✅ | ✅ | npm i @mediar-ai/cli |
KV (@mediar-ai/kv) | ✅ | ✅ | ✅ | npm i @mediar-ai/kv |
MCP (computeruse-mcp-agent) | ✅ | ✅ | ✅ | npx -y computeruse-mcp-agent --add-to-app [app] |
Rust (computeruse-rs) | ✅ | ✅ | ✅ | cargo add computeruse-rs |
Legend:
Platform Notes:
wmctrl and xdotool.name:Seven)To create reliable selectors (e.g. name:Seven, role:Button, window:Calculator), you need to inspect the Windows Accessibility Tree:
osascript -e 'tell application "System Events" to entire contents of window 1 of application process "Safari"'sudo apt install accerciser or sudo dnf install accerciserThese tools show you the
Name,Role,ControlType, and other metadata used in ComputerUse selectors.
| Platform | CLI | MCP Agent | Automation | Installation Method |
|---|---|---|---|---|
| Windows | ✅ | ✅ | ✅ | npm/bunx |
| macOS | ✅ | ✅ | ✅ | npm/bunx (requires Accessibility permissions) |
| Linux | ✅ | ✅ | ✅ | npm/bunx (requires AT-SPI2, wmctrl/xdotool) |
Note:
wmctrl/xdotool for X11 window managementFor detailed troubleshooting, debugging, and MCP server logs, send us a message.
Contributions are welcome! Please feel free to submit issues and pull requests. many parts are experimental, and help is appreciated.