CVJul 23, 2021

RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles

Junyeop Lee, Yoonsik Kim, Seonghyeon Kim, Moonbin Yim, Seung Shin, Gayoung Lee, Sungrae Park

arXiv:2107.11041v28.011 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a challenging task in computer vision for applications like image editing and augmentation, but appears incremental as it builds on existing scene text editing methods.

The paper tackles the problem of scene text editing by converting text in images to desired text while preserving style, proposing RewriteNet which implicitly decomposes content and style features and achieves better generation performance than other methods.

Scene text editing (STE), which converts a text in a scene image into the desired text while preserving an original style, is a challenging task due to a complex intervention between text and style. In this paper, we propose a novel STE model, referred to as RewriteNet, that decomposes text images into content and style features and re-writes a text in the original image. Specifically, RewriteNet implicitly distinguishes the content from the style by introducing scene text recognition. Additionally, independent of the exact supervisions with synthetic examples, we propose a self-supervised training scheme for unlabeled real-world images, which bridges the domain gap between synthetic and real data. Our experiments present that RewriteNet achieves better generation performances than other comparisons. Further analysis proves the feature decomposition of RewriteNet and demonstrates the reliability and robustness through diverse experiments. Our implementation is publicly available at \url{https://github.com/clovaai/rewritenet}

View on arXiv PDF Code

Similar