CVAISep 3, 2020

Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild

arXiv:2009.01766v125 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing manual labeling effort for scene text detection in real-world applications, though it is incremental as it builds on existing domain adaptation techniques.

The paper tackles the problem of domain distribution mismatch between synthetic and real data for scene text detection by proposing a synthetic-to-real domain adaptation method, achieving up to 10% improvement on benchmarks like ICDAR2015 and ICDAR2013.

Deep learning-based scene text detection can achieve preferable performance, powered with sufficient labeled training data. However, manual labeling is time consuming and laborious. At the extreme, the corresponding annotated data are unavailable. Exploiting synthetic data is a very promising solution except for domain distribution mismatches between synthetic datasets and real datasets. To address the severe domain distribution mismatch, we propose a synthetic-to-real domain adaptation method for scene text detection, which transfers knowledge from synthetic data (source domain) to real data (target domain). In this paper, a text self-training (TST) method and adversarial text instance alignment (ATA) for domain adaptive scene text detection are introduced. ATA helps the network learn domain-invariant features by training a domain classifier in an adversarial manner. TST diminishes the adverse effects of false positives~(FPs) and false negatives~(FNs) from inaccurate pseudo-labels. Two components have positive effects on improving the performance of scene text detectors when adapting from synthetic-to-real scenes. We evaluate the proposed method by transferring from SynthText, VISD to ICDAR2015, ICDAR2013. The results demonstrate the effectiveness of the proposed method with up to 10% improvement, which has important exploration significance for domain adaptive scene text detection. Code is available at https://github.com/weijiawu/SyntoReal_STD

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes