CVApr 3, 2023

Disentangled Pre-training for Image Matting

Tencent
arXiv:2304.00784v22 citationsh-index: 70
Originality Incremental advance
AI Analysis

This addresses the scalability issue in image matting research by reducing reliance on expensive human annotations, though it is incremental as it builds on existing self-supervised pre-training ideas.

The paper tackles the problem of costly pixel-level annotations for image matting by proposing a self-supervised pre-training approach that leverages infinite data, resulting in outperforming state-of-the-art methods and other self-supervised initializations by a large margin.

Image matting requires high-quality pixel-level human annotations to support the training of a deep model in recent literature. Whereas such annotation is costly and hard to scale, significantly holding back the development of the research. In this work, we make the first attempt towards addressing this problem, by proposing a self-supervised pre-training approach that can leverage infinite numbers of data to boost the matting performance. The pre-training task is designed in a similar manner as image matting, where random trimap and alpha matte are generated to achieve an image disentanglement objective. The pre-trained model is then used as an initialisation of the downstream matting task for fine-tuning. Extensive experimental evaluations show that the proposed approach outperforms both the state-of-the-art matting methods and other alternative self-supervised initialisation approaches by a large margin. We also show the robustness of the proposed approach over different backbone architectures. Our project page is available at https://crystraldo.github.io/dpt_mat/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes