CVOct 28, 2017

Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition

arXiv:1710.10400v1534 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This dataset fills a gap for the scene text community by providing curved text examples, but it is incremental as it builds on existing datasets.

The authors introduced Total-Text, a dataset addressing the lack of curved-oriented text in existing scene text datasets, and benchmarked DeconvNet on it to evaluate robustness against curved text.

Text in curve orientation, despite being one of the common text orientations in real world environment, has close to zero existence in well received scene text datasets such as ICDAR2013 and MSRA-TD500. The main motivation of Total-Text is to fill this gap and facilitate a new research direction for the scene text community. On top of the conventional horizontal and multi-oriented texts, it features curved-oriented text. Total-Text is highly diversified in orientations, more than half of its images have a combination of more than two orientations. Recently, a new breed of solutions that casted text detection as a segmentation problem has demonstrated their effectiveness against multi-oriented text. In order to evaluate its robustness against curved text, we fine-tuned DeconvNet and benchmark it on Total-Text. Total-Text with its annotation is available at https://github.com/cs-chan/Total-Text-Dataset

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes