docs/quick-start.md
We're excited to announce the support for UI-TARS-1.5! ๐๐๐
The previous version of UI-TARS Desktop version 0.0.8 will be upgraded to a new Desktop App 0.1.0 with support for both Computer and Browser operator.
Please install Chrome (stable/beta/dev/canary), Edge (stable/beta/dev/canary), or Firefox (stable/beta/dev/nightly) for Browser Operator.
UI-TARS-desktop is currently only available for single monitor setup. Multi-monitor configuration may cause failure for some tasks.
You can download the latest release version of UI-TARS Desktop from our releases page.
Note: If you have Homebrew installed, you can install UI-TARS Desktop by running the following command:
bashbrew install --cask ui-tars
Drag UI TARS application into the Applications folder
Enable the permission of UI TARS in MacOS:
Still to run the application, you can see the following interface:
The Remote Operator service will be discontinued on August 20, 2025. If you wish to deploy your own Remote Computer and Browser Agent after the free trial, you can explore Volcano Engine's OS Agent Services.
Deployment Links (in Chinese): Computer Use Agent and Browser Use Agent
Click the button Deploy from Hugging Face on the top right corner of the page
Select the model UI-TARS-1.5-7B
Refer to README_deploy.md for detailed deployment instructions to obtain the Base URL, API Key, and Model Name.
Open the UI-TARS Desktop App Settings and configure:
Language: en
VLM Provider: Hugging Face for UI-TARS-1.5
VLM Base URL: https:xxx
VLM API KEY: your_api_key
VLM Model Name: xxx
[!NOTE]
For VLM Provider, make sure to select "Hugging Face for UI-TARS-1.5" to ensure proper VLM Action parsing.
For VLM Base URL & VLM Model Name, you can checkout your huggingface endpoint page to see detail information. Please make sure Base URL ends with '/v1/'
Click button starting a new chat
Input the command to start a round of GUI operation tasks!
Visit the VolcEngine Doubao-1.5-UI-TARS page
Click the button Try (็ซๅณไฝ้ช) on the top right corner of the page
Click the API inference (API ๆฅๅ
ฅ) link
Get your API Key from STEP 1 in the drawer panel.
In STEP 2, authenticate your user info and switch to the OpenAI SDK tab to obtain Base Url and Model name๏ผ
Open the UI-TARS Desktop App Settings and configure:
Language: cn
VLM Provider: VolcEngine Ark for Doubao-1.5-UI-TARS
VLM Base URL: https://ark.cn-beijing.volces.com/api/v3
VLM API KEY: YOUR_API_KEY
VLM Model Name: doubao-1.5-ui-tars-250328
[!NOTE] For VLM Provider, make sure to select "VolcEngine Ark for Doubao-1.5-UI-TARS" to ensure proper VLM Action parsing.
[!NOTE] Before using
Browser Operatormode, please ensure that Chrome, Edge, or Firefox is installed on your device.
At this point, you should have successfully launched the UI-TARS-Desktop App! To get the most out of UI-TARS and ensure stable usage, we recommend reviewing the following documentation: