CVAIDec 31, 2023

Masked Modeling for Self-supervised Representation Learning on Vision and Beyond

arXiv:2401.00897v234 citationsh-index: 26Has Code
Originality Synthesis-oriented
AI Analysis

It provides a comprehensive overview for researchers, but is incremental as it synthesizes existing work without introducing new methods.

This survey reviews masked modeling as a self-supervised learning approach for representation learning across vision and other domains, highlighting its ability to learn robust representations with low dependence on labeled data.

As the deep learning revolution marches on, self-supervised learning has garnered increasing attention in recent years thanks to its remarkable representation learning ability and the low dependence on labeled data. Among these varied self-supervised techniques, masked modeling has emerged as a distinctive approach that involves predicting parts of the original data that are proportionally masked during training. This paradigm enables deep models to learn robust representations and has demonstrated exceptional performance in the context of computer vision, natural language processing, and other modalities. In this survey, we present a comprehensive review of the masked modeling framework and its methodology. We elaborate on the details of techniques within masked modeling, including diverse masking strategies, recovering targets, network architectures, and more. Then, we systematically investigate its wide-ranging applications across domains. Furthermore, we also explore the commonalities and differences between masked modeling methods in different fields. Toward the end of this paper, we conclude by discussing the limitations of current techniques and point out several potential avenues for advancing masked modeling research. A paper list project with this survey is available at \url{https://github.com/Lupin1998/Awesome-MIM}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes