CVJul 14, 2020

Tackling the Problem of Limited Data and Annotations in Semantic Segmentation

arXiv:2007.07357v1
Originality Incremental advance
AI Analysis

This addresses the problem of data scarcity in semantic segmentation for computer vision researchers, but it is incremental as it builds on existing techniques.

The study tackled semantic segmentation with limited data and weak annotations by transferring pre-trained models and applying CRF methods, showing that using a ResNet50 SWSL model improved results by 7.4% on a small dataset and by 4% on full data compared to ImageNet pre-training.

In this work, the case of semantic segmentation on a small image dataset (simulated by 1000 randomly selected images from PASCAL VOC 2012), where only weak supervision signals (scribbles from user interaction) are available is studied. Especially, to tackle the problem of limited data annotations in image segmentation, transferring different pre-trained models and CRF based methods are applied to enhance the segmentation performance. To this end, RotNet, DeeperCluster, and Semi&Weakly Supervised Learning (SWSL) pre-trained models are transferred and finetuned in a DeepLab-v2 baseline, and dense CRF is applied both as a post-processing and loss regularization technique. The results of my study show that, on this small dataset, using a pre-trained ResNet50 SWSL model gives results that are 7.4% better than applying an ImageNet pre-trained model; moreover, for the case of training on the full PASCAL VOC 2012 training data, this pre-training approach increases the mIoU results by almost 4%. On the other hand, dense CRF is shown to be very effective as well, enhancing the results both as a loss regularization technique in weakly supervised training and as a post-processing tool.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes