Back to Paddleocr

Algorithms

docs/version2.x/algorithm/overview.en.md

3.5.013.3 KB
Original Source

Algorithms

This tutorial lists the OCR algorithms supported by PaddleOCR, as well as the models and metrics of each algorithm on English public datasets. It is mainly used for algorithm introduction and algorithm performance comparison. For more models on other datasets including Chinese, please refer to PP-OCRv3 models list.

Developers are welcome to contribute more algorithms! Please refer to add new algorithm guideline.

1. Two-stage OCR Algorithms

1.1 Text Detection Algorithms

Supported text detection algorithms (Click the link to get the tutorial):

On the ICDAR2015 dataset, the text detection result is as follows:

ModelBackbonePrecisionRecallHmeanDownload link
EASTResNet50_vd88.71%81.36%84.88%trained model
EASTMobileNetV378.20%79.10%78.65%trained model
DBResNet50_vd86.41%78.72%82.38%trained model
DBMobileNetV377.29%73.08%75.12%trained model
SASTResNet50_vd91.39%83.77%87.42%trained model
PSEResNet50_vd85.81%79.53%82.55%trained model
PSEMobileNetV382.20%70.48%75.89%trained model
DB++ResNet5090.89%82.66%86.58%pretrained model/trained model

On Total-Text dataset, the text detection result is as follows:

ModelBackbonePrecisionRecallHmeanDownload link
SASTResNet50_vd89.63%78.44%83.66%trained model
CTResNet18_vd88.68%81.70%85.05%trained model

On CTW1500 dataset, the text detection result is as follows:

ModelBackbonePrecisionRecallHmeanDownload link
FCEResNet50_dcn88.39%82.18%85.27%trained model
DRRGResNet50_vd89.92%80.91%85.18%trained model

Note: Additional data, like icdar2013, icdar2017, COCO-Text, ArT, was added to the model training of SAST. Download English public dataset in organized format used by PaddleOCR from:

1.2 Text Recognition Algorithms

Supported text recognition algorithms (Click the link to get the tutorial):

Refer to DTRB, the training and evaluation result of these above text recognition (using MJSynth and SynthText for training, evaluate on IIIT, SVT, IC03, IC13, IC15, SVTP, CUTE) is as follow:

ModelBackboneAvg AccuracyModule combinationDownload link
RosettaResnet34_vd79.11%rec_r34_vd_none_none_ctctrained model
RosettaMobileNetV375.80%rec_mv3_none_none_ctctrained model
CRNNResnet34_vd81.04%rec_r34_vd_none_bilstm_ctctrained model
CRNNMobileNetV377.95%rec_mv3_none_bilstm_ctctrained model
StarNetResnet34_vd82.85%rec_r34_vd_tps_bilstm_ctctrained model
StarNetMobileNetV379.28%rec_mv3_tps_bilstm_ctctrained model
RAREResnet34_vd83.98%rec_r34_vd_tps_bilstm_atttrained model
RAREMobileNetV381.76%rec_mv3_tps_bilstm_atttrained model
SRNResnet50_vd_fpn86.31%rec_r50fpn_vd_none_srntrained model
NRTRNRTR_MTB84.21%rec_mtb_nrtrtrained model
SARResnet3187.20%rec_r31_sartrained model
SEEDAster_Resnet85.35%rec_resnet_stn_bilstm_atttrained model
SVTRSVTR-Tiny89.25%rec_svtr_tiny_none_ctc_entrained model
ViTSTRViTSTR79.82%rec_vitstr_none_cetrained model
ABINetResnet4590.75%rec_r45_abinettrained model
VisionLANResnet4590.30%rec_r45_visionlantrained model
SPINResNet3290.00%rec_r32_gaspin_bilstm_atttrained model
RobustScannerResNet3187.77%rec_r31_robustscannertrained model
RFLResNetRFL88.63%rec_resnet_rfl_atttrained model
ParseQVIT91.24%rec_vit_parseq_synthtrained model
CPPDSVTR-Base93.8%rec_svtrnet_cppd_base_entrained model
SATRNShallowCNN88.05%rec_satrntrained model

1.3 Text Super-Resolution Algorithms

Supported text super-resolution algorithms (Click the link to get the tutorial):

On the TextZoom public dataset, the effect of the algorithm is as follows:

ModelBackbonePSNR_AvgSSIM_AvgConfigDownload link
Text Gestalttsrn19.280.6560configs/sr/sr_tsrn_transformer_strock.ymltrained model
Text Telescopetbsrn21.560.7411configs/sr/sr_telescope.ymltrained model

1.4 Formula Recognition Algorithm

Supported formula recognition algorithms (Click the link to get the tutorial):

On the CROHME handwritten formula dataset, the effect of the algorithm is as follows:

ModelBackboneConfigExpRateDownload link
CANDenseNetrec_d28_can.yml51.72%trained model

2. End-to-end OCR Algorithms

Supported end-to-end algorithms (Click the link to get the tutorial):

3. Table Recognition Algorithms

Supported table recognition algorithms (Click the link to get the tutorial):

On the PubTabNet dataset, the algorithm result is as follows:

ModelBackboneConfigAccDownload link
TableMasterTableResNetExtraconfigs/table/table_master.yml77.47%trained model / inference model

4. Key Information Extraction Algorithms

Supported KIE algorithms (Click the link to get the tutorial):

On wildreceipt dataset, the algorithm result is as follows:

ModelBackboneConfigHmeanDownload link
SDMGRVGG6configs/kie/sdmgr/kie_unet_sdmgr.yml86.70%trained model

On XFUND_zh dataset, the algorithm result is as follows:

ModelBackboneTaskConfigHmeanDownload link
VI-LayoutXLMVI-LayoutXLM-baseSERser_vi_layoutxlm_xfund_zh_udml.yml93.19%trained model
LayoutXLMLayoutXLM-baseSERser_layoutxlm_xfund_zh.yml90.38%trained model
LayoutLMLayoutLM-baseSERser_layoutlm_xfund_zh.yml77.31%trained model
LayoutLMv2LayoutLMv2-baseSERser_layoutlmv2_xfund_zh.yml85.44%trained model
VI-LayoutXLMVI-LayoutXLM-baseREre_vi_layoutxlm_xfund_zh_udml.yml83.92%trained model
LayoutXLMLayoutXLM-baseREre_layoutxlm_xfund_zh.yml74.83%trained model
LayoutLMv2LayoutLMv2-baseREre_layoutlmv2_xfund_zh.yml67.77%trained model