CVApr 4, 2019

YOLACT: Real-time Instance Segmentation

arXiv:1904.02689v22089 citations
AI Analysis

This addresses the need for efficient instance segmentation in applications like autonomous driving or robotics, offering a novel approach that is not incremental but introduces a new paradigm.

The paper tackles real-time instance segmentation by introducing a fully-convolutional model that achieves 29.8 mAP on MS COCO at 33.5 fps on a single Titan Xp, significantly faster than prior competitive methods, and is trained on only one GPU.

We present a simple, fully-convolutional model for real-time instance segmentation that achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is significantly faster than any previous competitive approach. Moreover, we obtain this result after training on only one GPU. We accomplish this by breaking instance segmentation into two parallel subtasks: (1) generating a set of prototype masks and (2) predicting per-instance mask coefficients. Then we produce instance masks by linearly combining the prototypes with the mask coefficients. We find that because this process doesn't depend on repooling, this approach produces very high-quality masks and exhibits temporal stability for free. Furthermore, we analyze the emergent behavior of our prototypes and show they learn to localize instances on their own in a translation variant manner, despite being fully-convolutional. Finally, we also propose Fast NMS, a drop-in 12 ms faster replacement for standard NMS that only has a marginal performance penalty.

Code Implementations48 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes