docs/integrations/paddle_ocr.md
Kotaemon provides two PaddleOCR readers to enable document ingestion with full layout understanding, including multilingual text, tables, figures, formulas, and seals.
PaddleOCRVLReader: Wraps the PaddleOCR-VL 1.5 visual-language model for robust layout and VQA-based parsing.PPStructureV3Reader: Uses the PPStructureV3 pipeline for structured layout analysis, including table and chart detection.Both readers are located under kotaemon/loaders/paddleocr_loader.
# CPU
uv pip install paddlepaddle==3.3.0 -i https://www.paddlepaddle.org.cn/packages/stable/cpu/
# gpu,requires GPU driver version ≥550.54.14 (Linux) or ≥550.54.14 (Windows)
uv pip install paddlepaddle-gpu==3.3.0 -i https://www.paddlepaddle.org.cn/packages/stable/cu130/
uv pip install -e "libs/kotaemon[paddleocr]"
PADDLE_DEVICE environment variable in your .env file to control the execution device.PADDLE_DEVICE=gpu # cpu, gpu:0
PaddleOCR PPStructureV3 (table+figure extraction)PaddleOCR-VL (VLM document parsing)