cookbook/docker/RAG-VLM/README.md
This demo showcases video analysis using AutoNeural model through Nexa SDK. It extracts frames from uploaded videos at 8-second intervals and performs real-time AI inference using the AutoNeural vision-language model.
┌─────────────────┐
│ Gradio UI │ (Port 7860)
│ - Video upload │
│ - Frame extract│
│ - Real-time │
│ display │
└────────┬────────┘
│ HTTP API
▼
┌─────────────────┐
│ Nexa Serve │ (Port 18181)
│ AutoNeural │
│ /v1/chat/ │
│ completions │
└─────────────────┘
cd cookbook/docker/RAG-VLM
docker build -t autoneural-video-demo .
Mac or Linux
export NEXA_TOKEN={your_token_here}
Windows (Powershell)
$env:NEXA_TOKEN={your_token_here}
docker run -d \
--name autoneural-demo \
--privileged \
-p 18182:18181 \
-p 7860:7860 \
-e NEXA_TOKEN \
-v ~/.cache/nexa.ai:/root/.cache/nexa.ai \
autoneural-video-demo
Note:
~/.cache/nexa.ai with your local model cache directory, or omit the -v flag to use container's internal cache.nexa pull NexaAI/AutoNeuralOpen your browser and navigate to:
http://localhost:7860
NexaAI/AutoNeuralhttp://127.0.0.1:18181nexa command availablenexa pull NexaAI/AutoNeuralpip install -r requirements.txt
nexa serve --host 127.0.0.1:18181
python gradio_ui.py
http://localhost:7860gradio_ui.py via FRAME_INTERVAL_SECONDS)CLIP_LENGTH_SECONDS)NexaAI/AutoNeuralhttp://127.0.0.1:18181These can be changed in the UI's "Model Settings" accordion.
docker logs autoneural-demonexa pull NexaAI/AutoNeuralCopyright 2024-2025 Nexa AI, Inc.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.