CVAug 14, 2021

MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding

arXiv:2108.06543v189 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This toolbox facilitates research and industrial applications in OCR by providing a more extensive set of algorithms and resources than existing open-source projects.

The authors introduced MMOCR, an open-source toolbox that provides a comprehensive pipeline for text detection, recognition, and understanding tasks, implementing 14 state-of-the-art algorithms and offering trained models and benchmarks.

We present MMOCR-an open-source toolbox which provides a comprehensive pipeline for text detection and recognition, as well as their downstream tasks such as named entity recognition and key information extraction. MMOCR implements 14 state-of-the-art algorithms, which is significantly more than all the existing open-source OCR projects we are aware of to date. To facilitate future research and industrial applications of text recognition-related problems, we also provide a large number of trained models and detailed benchmarks to give insights into the performance of text detection, recognition and understanding. MMOCR is publicly released at https://github.com/open-mmlab/mmocr.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes