CVLGMay 14, 2019

End to End Recognition System for Recognizing Offline Unconstrained Vietnamese Handwriting

arXiv:1905.05381v16 citations
Originality Incremental advance
AI Analysis

This addresses the problem of recognizing offline unconstrained Vietnamese handwriting, which is an incremental improvement over existing methods.

The authors tackled offline unconstrained Vietnamese handwriting recognition by proposing an attention-based encoder-decoder model, achieving a 12.30% word error rate without a language model, which is competitive with Google's system in a Vietnamese competition.

Inspired by recent successes in neural machine translation and image caption generation, we present an attention based encoder decoder model (AED) to recognize Vietnamese Handwritten Text. The model composes of two parts: a DenseNet for extracting invariant features, and a Long Short-Term Memory network (LSTM) with an attention model incorporated for generating output text (LSTM decoder), which are connected from the CNN part to the attention model. The input of the CNN part is a handwritten text image and the target of the LSTM decoder is the corresponding text of the input image. Our model is trained end-to-end to predict the text from a given input image since all the parts are differential components. In the experiment section, we evaluate our proposed AED model on the VNOnDB-Word and VNOnDB-Line datasets to verify its efficiency. The experiential results show that our model achieves 12.30% of word error rate without using any language model. This result is competitive with the handwriting recognition system provided by Google in the Vietnamese Online Handwritten Text Recognition competition.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes