ML LG MEJan 2, 2025

Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers

arXiv:2501.01414v27.83 citationsh-index: 2J Am Stat Assoc

Originality Incremental advance

AI Analysis

This work addresses the need for interpretable and statistically sound generative models in high-stakes applications, representing an incremental improvement with a focus on theoretical guarantees.

The paper tackles the problem of non-identifiability and lack of interpretability in deep generative models by proposing Deep Discrete Encoders (DDEs), which are interpretable models with discrete latent layers, and demonstrates their effectiveness through extensive simulations and applications to diverse datasets.

In the era of generative AI, deep generative models (DGMs) with latent representations have gained tremendous popularity. Despite their impressive empirical performance, the statistical properties of these models remain underexplored. DGMs are often overparametrized, non-identifiable, and uninterpretable black boxes, raising serious concerns when deploying them in high-stakes applications. Motivated by this, we propose interpretable deep generative models for rich data types with discrete latent layers, called Deep Discrete Encoders (DDEs). A DDE is a directed graphical model with multiple binary latent layers. Theoretically, we propose transparent identifiability conditions for DDEs, which imply progressively smaller sizes of the latent layers as they go deeper. Identifiability ensures consistent parameter estimation and inspires an interpretable design of the deep architecture. Computationally, we propose a scalable estimation pipeline of a layerwise nonlinear spectral initialization followed by a penalized stochastic approximation EM algorithm. This procedure can efficiently estimate models with exponentially many latent components. Extensive simulation studies for high-dimensional data and deep architectures validate our theoretical results and demonstrate the excellent performance of our algorithms. We apply DDEs to three diverse real datasets with different data types to perform hierarchical topic modeling, image representation learning, and response time modeling in educational testing.

View on arXiv PDF

Similar