CVJan 27, 2019

Pixelated Semantic Colorization

arXiv:1901.10889v289 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of producing semantically accurate colorizations for computer vision applications, representing an incremental improvement over existing methods.

The paper tackles the problem of limited semantic understanding in image colorization by using pixelated object semantics to guide the process, resulting in more realistic and finer colorized images compared to state-of-the-art methods on datasets like PASCAL VOC2012 and COCO-stuff.

While many image colorization algorithms have recently shown the capability of producing plausible color versions from gray-scale photographs, they still suffer from limited semantic understanding. To address this shortcoming, we propose to exploit pixelated object semantics to guide image colorization. The rationale is that human beings perceive and distinguish colors based on the semantic categories of objects. Starting from an autoregressive model, we generate image color distributions, from which diverse colored results are sampled. We propose two ways to incorporate object semantics into the colorization model: through a pixelated semantic embedding and a pixelated semantic generator. Specifically, the proposed convolutional neural network includes two branches. One branch learns what the object is, while the other branch learns the object colors. The network jointly optimizes a color embedding loss, a semantic segmentation loss and a color generation loss, in an end-to-end fashion. Experiments on PASCAL VOC2012 and COCO-stuff reveal that our network, when trained with semantic segmentation labels, produces more realistic and finer results compared to the colorization state-of-the-art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes