docs/integrate-openwebui.md
This guide shows how to integrate Nexa-SDK with OpenWebUI for a web-based chat interface with locally running models.
OpenWebUI is a modern, feature-rich web interface for interacting with language models. By integrating it with Nexa-SDK, you get:
Before starting, ensure you have:
First, download the model you want to use:
nexa pull Qwen/Qwen3-VL-8B-Instruct-GGUF
Then start the Nexa Serve API server:
nexa serve
The server will start and listen on http://127.0.0.1:18181. You should see output like:
[INFO] Nexa Serve running on http://127.0.0.1:18181
[INFO] OpenAI-compatible API at http://127.0.0.1:18181/v1
Follow the OpenWebUI Installation Guide to set up OpenWebUI on your machine.
Start OpenWebUI, open your browser and navigate to your OpenWebUI instance, typically at:
http://localhost:8080
Sign up and login:
Navigate to Settings:
Configure Nexa-SDK Connection:
Fill in Connection Details:
http://127.0.0.1:18181/v1http://localhost:8080any-key (Nexa Serve doesn't require authentication)Add Model ID:
Qwen/Qwen3-VL-8B-Instruct-GGUF
Problem: OpenWebUI cannot reach Nexa Serve
Solutions:
nexa servehttp://127.0.0.1:18181/v1Problem: The model ID is not recognized
Solutions:
nexa pull Qwen/Qwen3-VL-8B-Instruct-GGUFnexa listnexa list output