CVAIMay 25, 2022

UniInst: Unique Representation for End-to-End Instance Segmentation

arXiv:2205.12646v47 citationsh-index: 16Has Code
Originality Incremental advance
AI Analysis

This addresses the need for end-to-end training in instance segmentation for computer vision researchers, though it is incremental as it builds on existing FCN-based methods.

The paper tackles the problem of redundant representations in instance segmentation, which requires non-maximum suppression (NMS) and hinders end-to-end training, by proposing UniInst, a box-free and NMS-free framework that yields one unique representation per instance, achieving 39.0 mask AP with ResNet-50-FPN and 40.2 mask AP with ResNet-101-FPN on COCO test-dev 2017.

Existing instance segmentation methods have achieved impressive performance but still suffer from a common dilemma: redundant representations (e.g., multiple boxes, grids, and anchor points) are inferred for one instance, which leads to multiple duplicated predictions. Thus, mainstream methods usually rely on a hand-designed non-maximum suppression (NMS) post-processing step to select the optimal prediction result, which hinders end-to-end training. To address this issue, we propose a box-free and NMS-free end-to-end instance segmentation framework, termed UniInst, that yields only one unique representation for each instance. Specifically, we design an instance-aware one-to-one assignment scheme, namely Only Yield One Representation (OYOR), which dynamically assigns one unique representation to each instance according to the matching quality between predictions and ground truths. Then, a novel prediction re-ranking strategy is elegantly integrated into the framework to address the misalignment between the classification score and the mask quality, enabling the learned representation to be more discriminative. With these techniques, our UniInst, the first FCN-based box-free and NMS-free instance segmentation framework, achieves competitive performance, e.g., 39.0 mask AP using ResNet-50-FPN and 40.2 mask AP using ResNet-101-FPN, against mainstream methods on COCO test-dev 2017. Moreover, the proposed instance-aware method is robust to occlusion scenes, outperforming common baselines by remarkable mask AP on the heavily-occluded OCHuman benchmark. Code is available at https://github.com/b03505036/UniInst.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes