CVMar 30, 2024

Learning Trimaps via Clicks for Image Matting

Georgia Tech
arXiv:2404.00335v34 citationsh-index: 55IEEE transactions on multimedia
Originality Highly original
AI Analysis

This addresses the time-consuming and user-unfriendly process of obtaining trimaps for image matting, making it more practical for real-world applications.

The paper tackles the problem of image matting's reliance on manually-drawn trimaps by introducing Click2Trimap, an interactive model that predicts high-quality trimaps and alpha mattes with minimal user clicks, achieving results in an average of 5 seconds per image.

Despite significant advancements in image matting, existing models heavily depend on manually-drawn trimaps for accurate results in natural image scenarios. However, the process of obtaining trimaps is time-consuming, lacking user-friendliness and device compatibility. This reliance greatly limits the practical application of all trimap-based matting methods. To address this issue, we introduce Click2Trimap, an interactive model capable of predicting high-quality trimaps and alpha mattes with minimal user click inputs. Through analyzing real users' behavioral logic and characteristics of trimaps, we successfully propose a powerful iterative three-class training strategy and a dedicated simulation function, making Click2Trimap exhibit versatility across various scenarios. Quantitative and qualitative assessments on synthetic and real-world matting datasets demonstrate Click2Trimap's superior performance compared to all existing trimap-free matting methods. Especially, in the user study, Click2Trimap achieves high-quality trimap and matting predictions in just an average of 5 seconds per image, demonstrating its substantial practical value in real-world applications.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes