CVJun 18, 2020

Learning High-Resolution Domain-Specific Representations with a GAN Generator

Danil Galeev, Konstantin Sofiiuk, Danila Rukhovich, Mikhail Romanov, Olga Barinova, Anton Konushin

arXiv:2006.10451v19.115 citations

Originality Incremental advance

AI Analysis

This work addresses semi-supervised semantic segmentation for computer vision, offering an incremental improvement in domain-specific pretraining.

The authors tackled the problem of learning domain-specific representations for semantic segmentation by leveraging a GAN generator, showing that LayerMatch pretraining outperforms ImageNet pretraining and recent semi-supervised methods in accuracy.

In recent years generative models of visual data have made a great progress, and now they are able to produce images of high quality and diversity. In this work we study representations learnt by a GAN generator. First, we show that these representations can be easily projected onto semantic segmentation map using a lightweight decoder. We find that such semantic projection can be learnt from just a few annotated images. Based on this finding, we propose LayerMatch scheme for approximating the representation of a GAN generator that can be used for unsupervised domain-specific pretraining. We consider the semi-supervised learning scenario when a small amount of labeled data is available along with a large unlabeled dataset from the same domain. We find that the use of LayerMatch-pretrained backbone leads to superior accuracy compared to standard supervised pretraining on ImageNet. Moreover, this simple approach also outperforms recent semi-supervised semantic segmentation methods that use both labeled and unlabeled data during training. Source code for reproducing our experiments will be available at the time of publication.

View on arXiv PDF

Similar