CVMay 16, 2019

Stroke extraction for offline handwritten mathematical expression recognition

arXiv:1905.06749v227 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of offline recognition for mathematical expressions, which is harder than online due to lack of temporal data, by enabling the use of mature online methods, though it is incremental as it builds on existing techniques.

The paper tackles the problem of offline handwritten mathematical expression recognition by proposing an oversegmentation algorithm to extract strokes from bitmap images, achieving correct recognition rates of 58.22% to 65.65% on CROHME datasets when combined with existing online recognizers.

Offline handwritten mathematical expression recognition is often considered much harder than its online counterpart due to the absence of temporal information. In order to take advantage of the more mature methods for online recognition and save resources, an oversegmentation approach is proposed to recover strokes from textual bitmap images automatically. The proposed algorithm first breaks down the skeleton of a binarized image into junctions and segments, then segments are merged to form strokes, finally stroke order is normalized by using recursive projection and topological sort. Good offline accuracy was obtained in combination with ordinary online recognizers, which are not specially designed for extracted strokes. Given a ready-made state-of-the-art online handwritten mathematical expression recognizer, the proposed procedure correctly recognized 58.22%, 65.65%, and 65.22% of the offline formulas rendered from the datasets of the Competitions on Recognition of Online Handwritten Mathematical Expressions(CROHME) in 2014, 2016, and 2019 respectively. Furthermore, given a trainable online recognition system, retraining it with extracted strokes resulted in an offline recognizer with the same level of accuracy. On the other hand, the speed of the entire pipeline was fast enough to facilitate on-device recognition on mobile phones with limited resources. To conclude, stroke extraction provides an attractive way to build optical character recognition software.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes