CVJun 11, 2017

Text Extraction From Texture Images Using Masked Signal Decomposition

arXiv:1706.04041v3
AI Analysis

This addresses a more realistic scenario in text extraction for applications like OCR and autonomous driving, though it is incremental as it builds on existing segmentation methods.

The paper tackles the problem of extracting text from textured backgrounds with similar colors by modeling it as a signal decomposition task with overlaid components, and the proposed algorithm achieves significantly better results than recent works on challenging images.

Text extraction is an important problem in image processing with applications from optical character recognition to autonomous driving. Most of the traditional text segmentation algorithms consider separating text from a simple background (which usually has a different color from texts). In this work we consider separating texts from a textured background, that has similar color to texts. We look at this problem from a signal decomposition perspective, and consider a more realistic scenario where signal components are overlaid on top of each other (instead of adding together). When the signals are overlaid, to separate signal components, we need to find a binary mask which shows the support of each component. Because directly solving the binary mask is intractable, we relax this problem to the approximated continuous problem, and solve it by alternating optimization method. We show that the proposed algorithm achieves significantly better results than other recent works on several challenging images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes