AICurious Logo

What is: PP-OCR?

SourcePP-OCR: A Practical Ultra Lightweight OCR System
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

PP-OCR is an OCR system that consists of three parts, text detection, detected boxes rectification and text recognition. The purpose of text detection is to locate the text area in the image. In PP-OCR, Differentiable Binarization (DB) is used as text detector which is based on a simple segmentation network. It integrates feature extraction and sequence modeling. It adopts the Connectionist Temporal Classification (CTC) loss to avoid the inconsistency between prediction and label.