Back to Langflow

lfx-paddle

src/bundles/paddle/README.md

1.11.0.dev271.3 KB
Original Source

lfx-paddle

PaddleOCR as a standalone Langflow Extension Bundle.

Ships the PaddleOCR component, which performs either layout-aware document parsing into Markdown (PP-StructureV3, PaddleOCR-VL-1.6) or plain OCR text recognition (PP-OCRv5, PP-OCRv6). It talks to the PaddleOCR AI Studio async Job HTTP API (submit -> poll -> fetch) directly via httpx, so it does not require the paddleocr Python SDK (whose transitive pyyaml constraint conflicts with Langflow's dependency tree).

Install

bash
pip install lfx-paddle

The bundle is registered automatically via the langflow.extensions entry-point. After install, restart your Langflow server; the component will appear in the palette under the paddle group.

You will need an AI Studio access token (https://aistudio.baidu.com/account/accessToken) to run the component.

Develop

bash
cd src/bundles/paddle
pip install -e .
lfx extension validate src/lfx_paddle

Migration

Saved flows referencing the legacy class name or the old import paths under lfx.components.paddle.* are rewritten to the new namespaced ID ext:paddle:PaddleOCRComponent@official by the migration table in src/lfx/src/lfx/extension/migration/migration_table.json.