CVApr 24, 2018

Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation

arXiv:1804.08864v1151 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of segmenting invisible object regions for computer vision applications, representing an incremental advancement in instance-aware segmentation.

The authors tackled the problem of semantic amodal segmentation by developing the first end-to-end trainable model that predicts amodal instance masks along with visible and invisible parts in a single forward pass, outperforming the current baseline on the COCO amodal dataset by a large margin and providing strong baseline performance on two new datasets.

Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for semantic amodal segmentation that predicts the amodal instance masks as well as their visible and invisible part in a single forward pass. In a detailed analysis, we provide experiments to show which architecture choices are beneficial for an all-in-one amodal segmentation model. On the COCO amodal dataset, our model outperforms the current baseline for amodal segmentation by a large margin. To further evaluate our model, we provide two new datasets with ground truth for semantic amodal segmentation, D2S amodal and COCOA cls. For both datasets, our model provides a strong baseline performance. Using special data augmentation techniques, we show that amodal segmentation on D2S amodal is possible with reasonable performance, even without providing amodal training data.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes