The effect of scene context on weakly supervised semantic segmentation
This work addresses a specific problem in weakly supervised semantic segmentation for computer vision researchers, but it is incremental as it builds on existing methods.
The paper tackles the challenge of discriminating objects from background in weakly supervised semantic segmentation by proposing a scene recommender that adds target-specific scene contexts to the dataset, which improves model accuracy for those objects as validated in experiments.
Image semantic segmentation is parsing image into several partitions in such a way that each region of which involves a semantic concept. In a weakly supervised manner, since only image-level labels are available, discriminating objects from the background is challenging, and in some cases, much more difficult. More specifically, some objects which are commonly seen in one specific scene (e.g. 'train' typically is seen on 'railroad track') are much more likely to be confused. In this paper, we propose a method to add the target-specific scenes in order to overcome the aforementioned problem. Actually, we propose a scene recommender which suggests to add some specific scene contexts to the target dataset in order to train the model more accurately. It is notable that this idea could be a complementary part of the baselines of many other methods. The experiments validate the effectiveness of the proposed method for the objects for which the scene context is added.