CVDec 9, 2020

Recurrence-free unconstrained handwritten text recognition using gated fully convolutional network

arXiv:2012.04961v142 citationsHas Code
AI Analysis

This work addresses the problem of unconstrained handwritten text recognition for document analysis by offering a more efficient network architecture.

This paper proposes a Gated Fully Convolutional Network (GFCN) as a recurrence-free alternative for unconstrained handwritten text recognition. The model, trained with CTC loss, achieves competitive results on the RIMES and IAM datasets.

Unconstrained handwritten text recognition is a major step in most document analysis tasks. This is generally processed by deep recurrent neural networks and more specifically with the use of Long Short-Term Memory cells. The main drawbacks of these components are the large number of parameters involved and their sequential execution during training and prediction. One alternative solution to using LSTM cells is to compensate the long time memory loss with an heavy use of convolutional layers whose operations can be executed in parallel and which imply fewer parameters. In this paper we present a Gated Fully Convolutional Network architecture that is a recurrence-free alternative to the well-known CNN+LSTM architectures. Our model is trained with the CTC loss and shows competitive results on both the RIMES and IAM datasets. We release all code to enable reproduction of our experiments: https://github.com/FactoDeepLearning/LinePytorchOCR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes