docs/index/index.en.md
PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and apply them into practice.
PaddleOCR is being oversight by a PMC. Issues and PRs will be reviewed on a best-effort basis. For a complete overview of PaddlePaddle community, please visit community.
⚠️ Note: The Issues module is only for reporting program 🐞 bugs, for the rest of the questions, please move to the Discussions. Please note that if the Issue mentioned is not a bug, it will be moved to the Discussions module.
🔥🔥2025.3.7 release PaddleOCR v2.10, including:
12 new self-developed single models:
4 high-value multi-model combination solutions:
🔥 2024.10.18 release PaddleOCR v2.9, including:
PaddleX, an All-in-One development tool based on PaddleOCR's advanced technology, supports low-code full-process development capabilities in the OCR field:
🎨 Rich Model One-Click Call: Integrates 17 models related to text image intelligent analysis, general OCR, general layout parsing, table recognition, formula recognition, and seal recognition into 6 pipelines, which can be quickly experienced through a simple Python API one-click call. In addition, the same set of APIs also supports a total of 200+ models in image classification, object detection, image segmentation, and time series forecasting, forming 20+ single-function modules, making it convenient for developers to use model combinations.
🚀 High Efficiency and Low barrier of entry: Provides two methods based on unified commands and GUI to achieve simple and efficient use, combination, and customization of models. Supports multiple deployment methods such as high-performance inference, service-oriented deployment, and on-device deployment. Additionally, for various mainstream hardware such as NVIDIA GPU, Kunlunxin XPU, Ascend NPU, Cambricon MLU, and Haiguang DCU, models can be developed with seamless switching.
Supports PP-ChatOCRv3-doc, high-precision layout detection model based on RT-DETR and high-efficiency layout area detection model based on PicoDet, high-precision table structure recognition model, text image unwarping model UVDoc, formula recognition model LatexOCR, and document image orientation classification model based on PP-LCNet.
🔥2023.8.7 Release PaddleOCRrelease/2.7
Release PP-OCRv4, support mobile version and server version
ReleasePP-ChatOCR, a new scheme for extracting key information of general scenes using PP-OCR model and ERNIE LLM.
🔨2022.11 Add implementation of 4 cutting-edge algorithms:Text Detection DRRG, Text Recognition RFL, Image Super-Resolution Text Telescope,Handwritten Mathematical Expression Recognition CAN
2022.10 release optimized JS version PP-OCRv3 model with 4.3M model size, 8x faster inference time, and a ready-to-use web demo
🔥2022.8.24 Release PaddleOCR release/2.6
🔥2022.8 Release OCR scene application collection
2022.8 Add implementation of 8 cutting-edge algorithms
2022.5.9 Release PaddleOCR release/2.5
PaddleOCR support a variety of cutting-edge algorithms related to OCR, and developed industrial featured models/solution PP-OCR、PP-Structure and PP-ChatOCR on this basis, and get through the whole process of data production, model training, compression, inference and deployment.
It is recommended to start with the “quick experience” in the document tutorial
PaddleX provides a one-stop full-process high-efficiency development platform for flying paddle ecological model training, pressure, and push. Its mission is to help AI technology quickly land, and its vision is to make everyone an AI Developer!
If you want to request a new language support, a PR with 1 following files are needed:
{language}_dict.txt that contains a list of all characters. Please see the format example from other files in that folder.If your language has unique elements, please tell me in advance within any way, such as useful links, wikipedia and so on.
More details, please refer to Multilingual OCR Development Plan.
layout analysis + table recognition
SER (Semantic entity recognition)
RE (Relation Extraction)
This project is released under <a href="https://github.com/PaddlePaddle/PaddleOCR/blob/master/LICENSE">Apache 2.0 license</a>