CVJan 1, 2018

Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

arXiv:1801.00470v4138 citations
Originality Incremental advance
AI Analysis

This addresses the problem of script identification for document and video analysis, particularly in challenging conditions like low image quality and complex backgrounds, but it is incremental as it builds on existing CNN-LSTM frameworks.

The paper tackles script identification in scene text images and video frames by proposing an attention-based CNN-LSTM network that dynamically weights local and global features, achieving superior results on four public datasets compared to conventional methods.

Script identification plays a significant role in analysing documents and videos. In this paper, we focus on the problem of script identification in scene text images and video scripts. Because of low image quality, complex background and similar layout of characters shared by some scripts like Greek, Latin, etc., text recognition in those cases become challenging. In this paper, we propose a novel method that involves extraction of local and global features using CNN-LSTM framework and weighting them dynamically for script identification. First, we convert the images into patches and feed them into a CNN-LSTM framework. Attention-based patch weights are calculated applying softmax layer after LSTM. Next, we do patch-wise multiplication of these weights with corresponding CNN to yield local features. Global features are also extracted from last cell state of LSTM. We employ a fusion technique which dynamically weights the local and global features for an individual patch. Experiments have been done in four public script identification datasets: SIW-13, CVSI2015, ICDAR-17 and MLe2e. The proposed framework achieves superior results in comparison to conventional methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes