Back to Paddleocr

OCR Model List(V3, updated on 2022.4.28)

docs/version2.x/ppocr/model_list.en.md

3.5.020.6 KB
Original Source

OCR Model List(V3, updated on 2022.4.28)

Note

  1. Compared with model v2, the 3rd version of the detection model has an improvement in accuracy, and the 2.1 version of the recognition model has optimizations in accuracy and speed with CPU.
  2. Compared with models 1.1, which are trained with static graph programming paradigm, models 2.0 or higher are the dynamic graph trained version and achieve close performance.
  3. All models in this tutorial are from the PaddleOCR series, for more introduction to algorithms and models based on the public dataset, you can refer to algorithm overview tutorial.

The downloadable models provided by PaddleOCR include the inference model, trained model, pre-trained model and nb model. The differences between the models are as follows:

model typemodel formatdescription
inference modelinference.pdmodel、inference.pdiparamsUsed for inference based on Paddle inference engine,detail
trained model, pre-trained model*.pdparams、*.pdopt、*.statesThe checkpoints model saved in the training process, which stores the parameters of the model, is mostly used for model evaluation and continuous training.
nb model*.nbModel optimized by Paddle-Lite, which is suitable for mobile-side deployment scenarios (Paddle-Lite is needed for nb model deployment).

The relationship of the above models is as follows.

1. Text Detection Model

1. Chinese Detection Model

model namedescriptionconfigmodel sizedownload
PP-OCRv4_mobile_det[New] Original lightweight model, supporting Chinese, English, multilingual text detectionPP-OCRv4_mobile_det.yml4.70Minference model / trained model
PP-OCRv4_server_det[New] Original high-precision model, supporting Chinese, English, multilingual text detectionPP-OCRv4_server_det.yml110Minference model / trained model
PP-OCRv3_mobile_detOriginal lightweight model, supporting Chinese, English, multilingual text detectionPP-OCRv3_mobile_det.yml1.1Minference model / trained model / nb模型
PP-OCRv3_server_detOriginal high-precision model, supporting Chinese, English, multilingual text detectionPP-OCRv3_server_det.yml3.80Minference model / trained model

1.2 English Detection Model

model namedescriptionconfigmodel sizedownload
en_PP-OCRv3_det_slim[New] Slim quantization with distillation lightweight detection model, supporting EnglishPP-OCRv3_det_cml.yml1.1Minference model / trained model / nb model
en_PP-OCRv3_det[New] Original lightweight detection model, supporting EnglishPP-OCRv3_det_cml.yml3.8Minference model / trained model
  • Note: English configuration file is the same as Chinese except for training data, here we only provide one configuration file.

1.3 Multilingual Detection Model

model namedescriptionconfigmodel sizedownload
ml_PP-OCRv3_det_slim[New] Slim quantization with distillation lightweight detection model, supporting EnglishPP-OCRv3_det_cml.yml1.1Minference model / trained model / nb model
ml_PP-OCRv3_det[New] Original lightweight detection model, supporting EnglishPP-OCRv3_det_cml.yml3.8Minference model / trained model
  • Note: English configuration file is the same as Chinese except for training data, here we only provide one configuration file.

2. Text Recognition Model

2.1 Chinese Recognition Model

model namedescriptionconfigmodel sizedownload
PP-OCRv3_mobile_rec_slim[New] Slim quantization with distillation lightweight model, supporting Chinese, English text recognitionPP-OCRv3_mobile_rec_distillation.yml4.9Minference model / trained model / nb model
PP-OCRv3_mobile_rec[New] Original lightweight model, supporting Chinese, English, multilingual text recognitionPP-OCRv3_mobile_rec_distillation.yml12.4Minference model / trained model
ch_PP-OCRv2_rec_slimSlim quantization with distillation lightweight model, supporting Chinese, English text recognitionch_PP-OCRv2_rec.yml9.0Minference model / trained model
ch_PP-OCRv2_recOriginal lightweight model, supporting Chinese, English, and multilingual text recognitionch_PP-OCRv2_rec_distillation.yml8.5Minference model / trained model
ch_ppocr_mobile_slim_v2.0_recSlim pruned and quantized lightweight model, supporting Chinese, English and number recognitionrec_chinese_lite_train_v2.0.yml6.0Minference model / trained model
ch_ppocr_mobile_v2.0_recOriginal lightweight model, supporting Chinese, English and number recognitionrec_chinese_lite_train_v2.0.yml5.2Minference model / trained model / pre-trained model
ch_ppocr_server_v2.0_recGeneral model, supporting Chinese, English and number recognitionrec_chinese_common_train_v2.0.yml94.8Minference model / trained model / pre-trained model

Note: The trained model is fine-tuned on the pre-trained model with real data and synthesized vertical text data, which achieved better performance in the real scene. The pre-trained model is directly trained on the full amount of real data and synthesized data, which is more suitable for fine-tuning your dataset.

2.2 English Recognition Model

model namedescriptionconfigmodel sizedownload
en_PP-OCRv3_mobile_rec_slim[New] Slim quantization with distillation lightweight model, supporting English, English text recognitionen_PP-OCRv3_mobile_rec.yml3.2Minference model / trained model / nb model
en_PP-OCRv3_mobile_rec[New] Original lightweight model, supporting English, English, multilingual text recognitionen_PP-OCRv3_mobile_rec.yml9.6Minference model / trained model
en_number_mobile_slim_v2.0_recSlim pruned and quantized lightweight model, supporting English and number recognitionrec_en_number_lite_train.yml2.7Minference model / trained model
en_number_mobile_v2.0_recOriginal lightweight model, supporting English and number recognitionrec_en_number_lite_train.yml2.6Minference model / trained model

Note: Dictionary file of all English recognition models is ppocr/utils/en_dict.txt.

2.3 Multilingual Recognition Model(Updating...)

model namedict filedescriptionconfigmodel sizedownload
korean_PP-OCRv3_mobile_recppocr/utils/dict/korean_dict.txtLightweight model for Korean recognitionkorean_PP-OCRv3_mobile_rec.yml11.0Minference model / trained model
japan_PP-OCRv3_mobile_recppocr/utils/dict/japan_dict.txtLightweight model for Japanese recognitionjapan_PP-OCRv3_mobile_rec.yml11.0Minference model / trained model
chinese_cht_PP-OCRv3_mobile_recppocr/utils/dict/chinese_cht_dict.txtLightweight model for chinese chtchinese_cht_PP-OCRv3_mobile_rec.yml12.0Minference model / trained model
te_PP-OCRv3_mobile_recppocr/utils/dict/te_dict.txtLightweight model for Telugu recognitionte_PP-OCRv3_mobile_rec.yml9.6Minference model / trained model
ka_PP-OCRv3_mobile_recppocr/utils/dict/ka_dict.txtLightweight model for Kannada recognitionka_PP-OCRv3_mobile_rec.yml9.9Minference model / trained model
ta_PP-OCRv3_mobile_recppocr/utils/dict/ta_dict.txtLightweight model for Tamil recognitionta_PP-OCRv3_mobile_rec.yml9.6Minference model / trained model
latin_PP-OCRv3_mobile_recppocr/utils/dict/latin_dict.txtLightweight model for latin recognitionlatin_PP-OCRv3_mobile_rec.yml9.7Minference model / trained model
arabic_PP-OCRv3_mobile_recppocr/utils/dict/arabic_dict.txtLightweight model for arabic recognitionarabic_PP-OCRv3_mobile_rec.yml9.6Minference model / trained model
cyrillic_PP-OCRv3_mobile_recppocr/utils/dict/cyrillic_dict.txtLightweight model for cyrillic recognitioncyrillic_PP-OCRv3_mobile_rec.yml9.6Minference model / trained model
devanagari_PP-OCRv3_mobile_recppocr/utils/dict/devanagari_dict.txtLightweight model for devanagari recognitiondevanagari_PP-OCRv3_mobile_rec.yml9.9Minference model / trained model

For a complete list of languages ​​and tutorials, please refer to Multi-language model

3. Text Angle Classification Model

model namedescriptionconfigmodel sizedownload
ch_ppocr_mobile_slim_v2.0_clsSlim quantized model for text angle classificationcls_mv3.yml2.1Minference model / trained model / nb model
ch_ppocr_mobile_v2.0_clsOriginal model for text angle classificationcls_mv3.yml1.38Minference model / trained model

4. Paddle-Lite Model

Paddle Lite is an updated version of Paddle-Mobile, an open-open source deep learning framework designed to make it easy to perform inference on mobile, embedded, and IoT devices. It can further optimize the inference model and generate the nb model used for edge devices. It's suggested to optimize the quantization model using Paddle-Lite because the INT8 format is used for the model storage and inference.

This chapter lists OCR nb models with PP-OCRv2 or earlier versions. You can access the latest nb models from the above tables.

VersionIntroductionModel sizeDetection modelText Direction modelRecognition modelPaddle-Lite branch
PP-OCRv2extra-lightweight chinese OCR optimized model11.0Mdownload linkdownload linkdownload linkv2.10
PP-OCRv2(slim)extra-lightweight chinese OCR optimized model4.6Mdownload linkdownload linkdownload linkv2.10
PP-OCRv2extra-lightweight chinese OCR optimized model11.0Mdownload linkdownload linkdownload linkv2.9
PP-OCRv2(slim)extra-lightweight chinese OCR optimized model4.9Mdownload linkdownload linkdownload linkv2.9
V2.0ppocr_v2.0 extra-lightweight chinese OCR optimized model7.8Mdownload linkdownload linkdownload linkv2.9
V2.0(slim)ppocr_v2.0 extra-lightweight chinese OCR optimized model3.3Mdownload linkdownload linkdownload linkv2.9