CVSep 23, 2019

Field typing for improved recognition on heterogeneous handwritten forms

arXiv:1909.10120v15 citations
Originality Incremental advance
AI Analysis

This addresses the problem of accurate handwriting recognition on heterogeneous forms for applications like document processing, but it is incremental as it builds on existing LSTM methods.

The paper tackled the problem of poor generalization of state-of-the-art handwriting recognition methods to real-world structured documents like forms, and showed that leveraging content type and synthetic data improved transcription on a challenging dataset of European Accident Statements.

Offline handwriting recognition has undergone continuous progress over the past decades. However, existing methods are typically benchmarked on free-form text datasets that are biased towards good-quality images and handwriting styles, and homogeneous content. In this paper, we show that state-of-the-art algorithms, employing long short-term memory (LSTM) layers, do not readily generalize to real-world structured documents, such as forms, due to their highly heterogeneous and out-of-vocabulary content, and to the inherent ambiguities of this content. To address this, we propose to leverage the content type within an LSTM-based architecture. Furthermore, we introduce a procedure to generate synthetic data to train this architecture without requiring expensive manual annotations. We demonstrate the effectiveness of our approach at transcribing text on a challenging, real-world dataset of European Accident Statements.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes