CVAug 26, 2019

Adaptive Embedding Gate for Attention-Based Scene Text Recognition

arXiv:1908.09475v141 citations
AI Analysis

This work addresses a specific bottleneck in scene text recognition, an incremental improvement for applications requiring accurate text extraction from images.

The authors tackled the problem of inappropriate use of previous predictions in attention-based scene text recognition, which restricts performance and stability, by proposing an adaptive embedding gate (AEG) module that introduces high-order character language models; experimental results on benchmarks like IIIT5K and SVT show that AEG significantly boosts recognition performance and improves robustness.

Scene text recognition has attracted particular research interest because it is a very challenging problem and has various applications. The most cutting-edge methods are attentional encoder-decoder frameworks that learn the alignment between the input image and output sequences. In particular, the decoder recurrently outputs predictions, using the prediction of the previous step as a guidance for every time step. In this study, we point out that the inappropriate use of previous predictions in existing attention mechanisms restricts the recognition performance and brings instability. To handle this problem, we propose a novel module, namely adaptive embedding gate(AEG). The proposed AEG focuses on introducing high-order character language models to attention mechanism by controlling the information transmission between adjacent characters. AEG is a flexible module and can be easily integrated into the state-of-the-art attentional methods. We evaluate its effectiveness as well as robustness on a number of standard benchmarks, including the IIIT$5$K, SVT, SVT-P, CUTE$80$, and ICDAR datasets. Experimental results demonstrate that AEG can significantly boost recognition performance and bring better robustness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes