LGJun 30, 2021

DAEMA: Denoising Autoencoder with Mask Attention

Simon Tihon, Muhammad Usama Javaid, Damien Fourure, Nicolas Posocco, Thomas Peel

arXiv:2106.16057v19.218 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses missing data imputation for real-world machine learning applications, representing an incremental improvement over existing deep learning approaches.

The paper tackles the problem of missing data imputation by proposing DAEMA, a denoising autoencoder with a mask attention mechanism, which achieves superior performance to state-of-the-art algorithms on real-world datasets under various missingness settings.

Missing data is a recurrent and challenging problem, especially when using machine learning algorithms for real-world applications. For this reason, missing data imputation has become an active research area, in which recent deep learning approaches have achieved state-of-the-art results. We propose DAEMA (Denoising Autoencoder with Mask Attention), an algorithm based on a denoising autoencoder architecture with an attention mechanism. While most imputation algorithms use incomplete inputs as they would use complete data - up to basic preprocessing (e.g. mean imputation) - DAEMA leverages a mask-based attention mechanism to focus on the observed values of its inputs. We evaluate DAEMA both in terms of reconstruction capabilities and downstream prediction and show that it achieves superior performance to state-of-the-art algorithms on several publicly available real-world datasets under various missingness settings.

View on arXiv PDF Code

Similar