LGMLMay 5, 2016

Rank Ordered Autoencoders

arXiv:1605.01749v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of unsupervised sparse representation learning in autoencoders, offering an implicit sparsity control method that could benefit machine learning practitioners, though it appears incremental as it builds on existing autoencoder frameworks.

The authors tackled the problem of learning sparse representations in autoencoders without extra hyperparameters by introducing a method that orders hidden unit activations and progressively reconstructs the input, leading to high sparsity and minimal reconstruction error. Results on CIFAR10 patches showed rapid convergence, extremely sparse activations, and robustness to overfitting.

A new method for the unsupervised learning of sparse representations using autoencoders is proposed and implemented by ordering the output of the hidden units by their activation value and progressively reconstructing the input in this order. This can be done efficiently in parallel with the use of cumulative sums and sorting only slightly increasing the computational costs. Minimizing the difference of this progressive reconstruction with respect to the input can be seen as minimizing the number of active output units required for the reconstruction of the input. The model thus learns to reconstruct optimally using the least number of active output units. This leads to high sparsity without the need for extra hyperparameters, the amount of sparsity is instead implicitly learned by minimizing this progressive reconstruction error. Results of the trained model are given for patches of the CIFAR10 dataset, showing rapid convergence of features and extremely sparse output activations while maintaining a minimal reconstruction error and showing extreme robustness to overfitting. Additionally the reconstruction as function of number of active units is presented which shows the autoencoder learns a rank order code over the input where the highest ranked units correspond to the highest decrease in reconstruction error.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes