CVAug 16, 2022

SGM-Net: Semantic Guided Matting Net

arXiv:2208.07496v12 citationsh-index: 10
Originality Incremental advance
AI Analysis

This addresses the practical difficulty of applying human matting in image synthesis and visual effects without green screens, but it is incremental as it builds on existing methods like MODNet.

The paper tackled the problem of human matting from natural images without additional inputs like trimaps, proposing SGM-Net which adds a semantic guidance module to MODNet to improve feature utilization, resulting in significant improvements on the P3M-10k dataset.

Human matting refers to extracting human parts from natural images with high quality, including human detail information such as hair, glasses, hat, etc. This technology plays an essential role in image synthesis and visual effects in the film industry. When the green screen is not available, the existing human matting methods need the help of additional inputs (such as trimap, background image, etc.), or the model with high computational cost and complex network structure, which brings great difficulties to the application of human matting in practice. To alleviate such problems, most existing methods (such as MODNet) use multi-branches to pave the way for matting through segmentation, but these methods do not make full use of the image features and only utilize the prediction results of the network as guidance information. Therefore, we propose a module to generate foreground probability map and add it to MODNet to obtain Semantic Guided Matting Net (SGM-Net). Under the condition of only one image, we can realize the human matting task. We verify our method on the P3M-10k dataset. Compared with the benchmark, our method has significantly improved in various evaluation indicators.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes