Geometric instability of out of distribution data across autoencoder architecture
This work addresses robustness issues in autoencoders for machine learning practitioners, though it is incremental as it focuses on analysis rather than proposing new solutions.
The study investigated the geometric instability of autoencoders when processing out-of-distribution data, finding that reconstructed images appear as generalized characters but latent representations vary significantly across architectures and inputs.
We study the map learned by a family of autoencoders trained on MNIST, and evaluated on ten different data sets created by the random selection of pixel values according to ten different distributions. Specifically, we study the eigenvalues of the Jacobians defined by the weight matrices of the autoencoder at each training and evaluation point. For high enough latent dimension, we find that each autoencoder reconstructs all the evaluation data sets as similar \emph{generalized characters}, but that this reconstructed \emph{generalized character} changes across autoencoder. Eigenvalue analysis shows that even when the reconstructed image appears to be an MNIST character for all out of distribution data sets, not all have latent representations that are close to the latent representation of MNIST characters. All told, the eigenvalue analysis demonstrated a great deal of geometric instability of the autoencoder both as a function on out of distribution inputs, and across architectures on the same set of inputs.