ML CV LG NEFeb 13, 2014

Zero-bias autoencoders and the benefits of co-adapting features

Kishore Konda, Roland Memisevic, David Krueger

arXiv:1402.3337v598 citations

Originality Highly original

AI Analysis

This addresses a bottleneck in representation learning for high-dimensional data, offering a novel solution that could improve autoencoder performance in domains like image or text processing.

The paper tackled the problem of autoencoders failing on high intrinsic dimensionality data due to negative hidden biases, and proposed a new activation function that decouples representation and sparsity roles, enabling successful learning without extra regularization.

Regularized training of an autoencoder typically results in hidden unit biases that take on large negative values. We show that negative biases are a natural result of using a hidden layer whose responsibility is to both represent the input data and act as a selection mechanism that ensures sparsity of the representation. We then show that negative biases impede the learning of data distributions whose intrinsic dimensionality is high. We also propose a new activation function that decouples the two roles of the hidden layer and that allows us to learn representations on data with very high intrinsic dimensionality, where standard autoencoders typically fail. Since the decoupled activation function acts like an implicit regularizer, the model can be trained by minimizing the reconstruction error of training data, without requiring any additional regularization.

View on arXiv PDF

Similar