Back to Paddleocr

1. Introduction to PP-OCRv5 Multilingual Text Recognition

docs/version3.x/algorithm/PP-OCRv5/PP-OCRv5_multi_languages.en.md

3.5.016.7 KB
Original Source

1. Introduction to PP-OCRv5 Multilingual Text Recognition

PP-OCRv5 is the latest generation text recognition solution in the PP-OCR series, focusing on multi-scenario and multilingual text recognition tasks. In terms of supported text types, the default configuration of the recognition model can accurately identify five major types: Simplified Chinese, Pinyin, Traditional Chinese, English, and Japanese. Additionally, PP-OCRv5 offers multilingual text recognition capabilities covering 106 languages, including Korean, Spanish, French, Portuguese, German, Italian, Russian, Thai, Greek and more (for a full list of supported languages and abbreviations, see Section 4). Compared to the previous PP-OCRv3 version, PP-OCRv5 achieves over a 30% improvement in accuracy for multilingual text recognition.

<div align="center">

<b>French Recognition Result</b>

</div> <div align="center">

<b>German Recognition Result</b>

</div> <div align="center">

<b>Korean Recognition Result</b>

</div> <div align="center">

<b>Russian Recognition Result</b>

</div> <div align="center">

<b>Thai recognition result</b>

</div> <div align="center">

<b>Greek recognition result</b>

</div> <div align="center">

<b>Arabic recognition Result</b>

</div> <div align="center">

<b>Hindi recognition Result</b>

</div> <div align="center">

<b>Tamil recognition Result</b>

</div> <div align="center">

<b>Telugu recognition Result</b>

</div>

2. Quick Start

You can specify the language for text recognition by using the --lang parameter when running the general OCR pipeline in the command line:

bash
# Use the `--lang` parameter to specify the French recognition model
paddleocr ocr -i https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_french01.png \
    --lang fr \
    --use_doc_orientation_classify False \
    --use_doc_unwarping False \
    --use_textline_orientation False \
    --save_path ./output \
    --device gpu:0 

For explanations of the other command-line parameters, please refer to the Command Line Usage section of the general OCR pipeline documentation. After running, the results will be displayed in the terminal:

bash
{'res': {'input_path': '/root/.paddlex/predict_input/general_ocr_french01.png', 'page_index': None, 'model_settings': {'use_doc_preprocessor': True, 'use_textline_orientation': False}, 'doc_preprocessor_res': {'input_path': None, 'page_index': None, 'model_settings': {'use_doc_orientation_classify': False, 'use_doc_unwarping': False}, 'angle': -1}, 'dt_polys': array([[[119,  23],
        ...,
        [118,  75]],

       ...,

       [[109, 506],
        ...,
        [108, 556]]], dtype=int16), 'text_det_params': {'limit_side_len': 64, 'limit_type': 'min', 'thresh': 0.3, 'max_side_limit': 4000, 'box_thresh': 0.6, 'unclip_ratio': 1.5}, 'text_type': 'general', 'textline_orientation_angles': array([-1, ..., -1]), 'text_rec_score_thresh': 0.0, 'rec_texts': ['mifere; la profpérité & les fuccès ac-', 'compagnent l’homme induftrieux.', 'Quel eft celui qui a acquis des ri-', 'cheffes, qui eft devenu puiffant, qui', 's’eft couvert de gloire, dont l’éloge', 'retentit par-tout, qui fiege au confeil', "du Roi? C'eft celui qui bannit la pa-", "reffe de fa maifon, & qui a dit à l'oifi-", 'veté : tu es mon ennemie.'], 'rec_scores': array([0.98409832, ..., 0.98091048]), 'rec_polys': array([[[119,  23],
        ...,
        [118,  75]],

       ...,

       [[109, 506],
        ...,
        [108, 556]]], dtype=int16), 'rec_boxes': array([[118, ...,  81],
       ...,
       [108, ..., 562]], dtype=int16)}}

If you specify save_path, the visualization results will be saved to the specified path. An example of the visualized result is shown below:

You can also use Python code to specify the recognition model for a particular language when initializing the general OCR pipeline via the lang parameter:

python
from paddleocr import PaddleOCR

ocr = PaddleOCR(
    lang="fr", # Specify French recognition model with the lang parameter
    use_doc_orientation_classify=False, # Disable document orientation classification model
    use_doc_unwarping=False, # Disable text image unwarping model
    use_textline_orientation=False, # Disable text line orientation classification model
)
result = ocr.predict("https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_french01.png")
for res in result:
    res.print()
    res.save_to_img("output")
    res.save_to_json("output")

For more details on the PaddleOCR class parameters, please refer to the Python Scripting Integration section of the general OCR pipeline documentation.

3. Performance Comparison

ModelDownload LinkAccuracy on the corresponding dataset (%)Improvement over the previous generation model (%) |
korean_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/korean_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/korean_PP-OCRv5_mobile_rec_pretrained.pdparams">Pretrained Model</a>88.065.0
latin_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/latin_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/latin_PP-OCRv5_mobile_rec_pretrained.pdparams">Pretrained Model</a>84.746.8
eslav_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/eslav_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/eslav_PP-OCRv5_mobile_rec_pretrained.pdparams">Pretrained Model</a>81.631.4
th_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/th_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/th_PP-OCRv5_mobile_rec_pretrained.pdparams">Pretrained Model</a>82.68-
el_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/el_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/el_PP-OCRv5_mobile_rec_pretrained.pdparams">Pretrained Model</a>89.28-
en_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/en_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/en_PP-OCRv5_mobile_rec_pretrained.pdparams">Pretrained Model</a>85.2511.0
cyrillic_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/cyrillic_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/cyrillic_PP-OCRv5_mobile_rec_pretrained.pdparams">Trained Model</a>80.2721.2
arabic_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/arabic_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/arabic_PP-OCRv5_mobile_rec_pretrained.pdparams">Trained Model</a>81.2722.83
devanagari_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/devanagari_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/devanagari_PP-OCRv5_mobile_rec_pretrained.pdparams">Trained Model</a>84.9668.26
te_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/te_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/te_PP-OCRv5_mobile_rec_pretrained.pdparams">Trained Model</a>87.6543.47
ta_PP-OCRv5_mobile_rec<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/ta_PP-OCRv5_mobile_rec_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/ta_PP-OCRv5_mobile_rec_pretrained.pdparams">Trained Model</a>94.239.23

Notes:

  • Korean Dataset: The latest PP-OCRv5 dataset containing 5,007 Korean text images.
  • Latin Script Language Dataset: The latest PP-OCRv5 dataset containing 3,111 images of Latin script languages.
  • East Slavic Language Dataset: The latest PP-OCRv5 dataset containing a total of 7,031 text images in Russian, Belarusian, and Ukrainian.
  • Thai dataset: The latest PP-OCRv5 constructed Thai dataset contains a total of 4,261 text images for recognition.
  • Greek dataset: The latest PP-OCRv5 constructed Greek dataset contains a total of 2,799 text images for recognition.
  • English dataset: The latest PP-OCRv5 constructed English dataset contains a total of 6,530 text images for recognition.
  • Cyrillic Dataset: The latest PP-OCRv5 Cyrillic recognition dataset contains a total of 7,600 text images.
  • Tamil Dataset: The latest PP-OCRv5 Tamil recognition dataset contains a total of 2,121 text images.
  • Telugu Dataset: The latest PP-OCRv5 Telugu recognition dataset contains a total of 2,478 text images.
  • Arabic Dataset: The latest PP-OCRv5 Arabic, Sanskrit, etc. recognition dataset contains a total of 2,676 text images.
  • Devanagari Dataset: The latest PP-OCRv5 Devanagari recognition dataset contains a total of 3,611 text images.

4. Supported Languages and Abbreviations

LanguageDescriptionAbbreviationLanguageDescriptionAbbreviation
ChineseChinese & EnglishchHungarianHungarianhu
EnglishEnglishenSerbian (latin)Serbian (latin)rs_latin
FrenchFrenchfrIndonesianIndonesianid
GermanGermandeOccitanOccitanoc
JapaneseJapanesejapanIcelandicIcelandicis
KoreanKoreankoreanLithuanianLithuanianlt
Traditional ChineseChinese Traditionalchinese_chtMaoriMaorimi
AfrikaansAfrikaansafMalayMalayms
ItalianItalianitDutchDutchnl
SpanishSpanishesNorwegianNorwegianno
BosnianBosnianbsPolishPolishpl
PortuguesePortugueseptSlovakSlovaksk
CzechCzechcsSlovenianSloveniansl
WelshWelshcyAlbanianAlbaniansq
DanishDanishdaSwedishSwedishsv
EstonianEstonianetSwahiliSwahilisw
IrishIrishgaTagalogTagalogtl
CroatianCroatianhrTurkishTurkishtr
UzbekUzbekuzLatinLatinla
RussianRussianruBelarusianBelarusianbe
UkrainianUkrainianukThaiThaith
GreekGreekelAzerbaijaniAzerbaijaniaz
KurdishKurdishkuLatvianLatvianlv
MalteseMaltesemtPaliPalipi
RomanianRomanianroVietnameseVietnamesevi
FinnishFinnishfiBasqueBasqueeu
GalicianGalicianglLuxembourgishLuxembourgishlb
RomanshRomanshrmCatalanCatalanca
QuechuaQuechuaquTeluguTelugute
Serbian (Cyrillic)Serbian (Cyrillic)rs_cyrillicBulgarianBulgarianbg
MongolianMongolianmnAbkhazAbkhazab
AdygheAdygheadyKabardianKabardiankbd
AvarAvaravDargwaDargwadar
IngushIngushinhChechenChechence
LakLaklkiLezgianLezgianlez
TabasaranTabasarantabKazakhKazakhkk
KyrgyzKyrgyzkyTajikTajiktg
MacedonianMacedonianmkTatarTatartt
ChuvashChuvashcvBashkirBashkirba
MariMarimhrMoldovanMoldovanmo
UdmurtUdmurtudmKomiKomikv
OssetianOssetianosBuriatBuriatbua
KalmykKalmykxalTuvinianTuviniantyv
SakhaSakhasahKarakalpakKarakalpakkaa
ArabicArabicarPersianPersianfa
UyghurUyghurugUrduUrduur
PashtoPashtopsKurdishKurdishku
SindhiSindhisdBalochiBalochibal
HindiHindihiMarathiMarathimr
NepaliNepalineBihariBiharibh
MaithiliMaithilimaiOld EnglishOld Englishang
BhojpuriBhojpuribhoMagahiMagahimah
SadriSadrisckNewarNewarnew
KonkaniKonkanigomSanskritSanskritsa
HaryanviHaryanvibgcTamilTamilta

5. Models and Their Supported Languages

ModelSupported Languages
korean_PP-OCRv5_mobile_recKorean, English
latin_PP-OCRv5_mobile_recFrench, German, Afrikaans, Italian, Spanish, Bosnian, Portuguese, Czech, Welsh, Danish, Estonian, Irish, Croatian, Uzbek, Hungarian, Serbian (Latin), Indonesian, Occitan, Icelandic, Lithuanian, Maori, Malay, Dutch, Norwegian, Polish, Slovak, Slovenian, Albanian, Swedish, Swahili, Tagalog, Turkish, Latin, Azerbaijani, Kurdish, Latvian, Maltese, Pali, Romanian, Vietnamese, Finnish, Basque, Galician, Luxembourgish, Romansh, Catalan, Quechua
eslav_PP-OCRv5_mobile_recRussian, Belarusian, Ukrainian, English
th_PP-OCRv5_mobile_recThai, English
el_PP-OCRv5_mobile_recGreek, English
en_PP-OCRv5_mobile_recEnglish
cyrillic_PP-OCRv5_mobile_recRussian, Belarusian, Ukrainian, Serbian (Cyrillic), Bulgarian, Mongolian, Abkhazian, Adyghe, Kabardian, Avar, Dargin, Ingush, Chechen, Lak, Lezgin, Tabasaran, Kazakh, Kyrgyz, Tajik, Macedonian, Tatar, Chuvash, Bashkir, Malian, Moldovan, Udmurt, Komi, Ossetian, Buryat, Kalmyk, Tuvan, Sakha, Karakalpak, English
arabic_PP-OCRv5_mobile_recArabic, Persian, Uyghur, Urdu, Pashto, Kurdish, Sindhi, Balochi, English
devanagari_PP-OCRv5_mobile_recHindi, Marathi, Nepali, Bihari, Maithili, Angika, Bhojpuri, Magahi, Santali, Newari, Konkani, Sanskrit, Haryanvi, English
ta_PP-OCRv5_mobile_recTamil, English
te_PP-OCRv5_mobile_recTelugu, English

note: en_PP-OCRv5_mobile_rec is an optimized version based on the PP-OCRv5 model, specifically fine-tuned for English scenarios. It demonstrates higher recognition accuracy and better adaptability when processing English text.