CVDec 16, 2019

MTRNet++: One-stage Mask-based Scene Text Eraser

Osman Tursun, Simon Denman, Rui Zeng, Sabesan Sivapalan, Sridha Sridharan, Clinton Fookes

arXiv:1912.07183v211.056 citationsHas Code

Originality Highly original

AI Analysis

This provides a precise, controllable, and interpretable text removal method for user-specific and large-scale applications.

The paper tackles the problem of scene text removal by proposing MTRNet++, a one-stage mask-based text inpainting network that can remove text with or without an external mask, achieving state-of-the-art results on the Oxford and SCUT datasets.

A precise, controllable, interpretable and easily trainable text removal approach is necessary for both user-specific and large-scale text removal applications. To achieve this, we propose a one-stage mask-based text inpainting network, MTRNet++. It has a novel architecture that includes mask-refine, coarse-inpainting and fine-inpainting branches, and attention blocks. With this architecture, MTRNet++ can remove text either with or without an external mask. It achieves state-of-the-art results on both the Oxford and SCUT datasets without using external ground-truth masks. The results of ablation studies demonstrate that the proposed multi-branch architecture with attention blocks is effective and essential. It also demonstrates controllability and interpretability.

View on arXiv PDF Code

Similar