Back to Nexa Sdk

Nexa Android SDK Demo App

cookbook/android/RAG-LLM/README.md

0.2.731.3 KB
Original Source

Nexa Android SDK Demo App

Overview

The Nexa AI Android SDK enables on-device AI inference for Android applications with NPU acceleration. Run Large Language Models (LLMs), Vision-Language Models (VLMs), Embeddings, Speech Recognition (ASR), Reranking, and Computer Vision models on Android devices with support for NPU, GPU, and CPU inference. Follow Android SDK Doc for more details. This folder contains the demo app for the Android SDK.

Device Compatibility

Supported Hardware

  • NPU: Qualcomm Snapdragon 8 Gen 4 (optimized)
  • GPU: Qualcomm Adreno GPU
  • CPU: ARM64-v8a
  • RAM: 4GB+ recommended
  • Storage: 100MB - 4GB (varies by model)

Minimum Requirements

  • Android API Level 27+ (Android 8.1 Oreo)
  • Architecture: ARM64-v8a
  • Android SDK Version: 27+

Build and Run

  1. Clone the repository
bash
git clone https://github.com/NexaAI/nexa-sdk/
  1. Open this folder bindings/android in Android Studio

  2. Follow the instructions in Android SDK Doc, download model (For example, OmniNeural-4B) and put it in App's assets folder (For example, /data/data/com.nexa.demo/files/models/omni-neural-4b ).

  3. Build and run the app