CVApr 3, 2019

Character Region Awareness for Text Detection

arXiv:1904.01941v1961 citations
Originality Highly original
AI Analysis

This addresses the challenge of detecting curved or deformed text in scene images, which is incremental over previous word-level bounding box methods.

The paper tackles the problem of detecting arbitrarily shaped text in natural images by proposing a character-level detection method that explores individual characters and their affinities, achieving state-of-the-art performance on benchmarks like TotalText and CTW-1500.

Scene text detection methods based on neural networks have emerged recently and have shown promising results. Previous methods trained with rigid word-level bounding boxes exhibit limitations in representing the text region in an arbitrary shape. In this paper, we propose a new scene text detection method to effectively detect text area by exploring each character and affinity between characters. To overcome the lack of individual character level annotations, our proposed framework exploits both the given character-level annotations for synthetic images and the estimated character-level ground-truths for real images acquired by the learned interim model. In order to estimate affinity between characters, the network is trained with the newly proposed representation for affinity. Extensive experiments on six benchmarks, including the TotalText and CTW-1500 datasets which contain highly curved texts in natural images, demonstrate that our character-level text detection significantly outperforms the state-of-the-art detectors. According to the results, our proposed method guarantees high flexibility in detecting complicated scene text images, such as arbitrarily-oriented, curved, or deformed texts.

Code Implementations18 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes