cookbook/android/RAG-LLM/README.md
The Nexa AI Android SDK enables on-device AI inference for Android applications with NPU acceleration. Run Large Language Models (LLMs), Vision-Language Models (VLMs), Embeddings, Speech Recognition (ASR), Reranking, and Computer Vision models on Android devices with support for NPU, GPU, and CPU inference. Follow Android SDK Doc for more details. This folder contains the demo app for the Android SDK.
git clone https://github.com/NexaAI/nexa-sdk/
Open this folder bindings/android in Android Studio
Follow the instructions in Android SDK Doc, download model (For example, OmniNeural-4B) and put it in App's assets folder (For example, /data/data/com.nexa.demo/files/models/omni-neural-4b ).
Build and run the app