ME LGMar 25

Minimal Sufficient Representations for Self-interpretable Deep Neural Networks

arXiv:2603.2404114.0h-index: 3

Predicted impact top 63% in ME · last 90 daysOriginality Highly original

AI Analysis

This addresses the challenge of making overparameterized deep networks interpretable for researchers and practitioners in fields like biomedicine and vision, offering a novel integration of interpretability with statistical guarantees.

The paper tackles the problem of interpretability in deep neural networks by introducing DeepIn, a self-interpretable framework that identifies minimal representations, achieving up to 30% error reduction on real-world datasets while preserving predictive performance.

Deep neural networks (DNNs) achieve remarkable predictive performance but remain difficult to interpret, largely due to overparameterization that obscures the minimal structure required for interpretation. Here we introduce DeepIn, a self-interpretable neural network framework that adaptively identifies and learns the minimal representation necessary for preserving the full expressive capacity of standard DNNs. We show that DeepIn can correctly identify the minimal representation dimension, select relevant variables, and recover the minimal sufficient network architecture for prediction. The resulting estimator achieves optimal non-asymptotic error rates that adapt to the learned minimal dimension, demonstrating that recovering minimal sufficient structure fundamentally improves generalization error. Building on these guarantees, we further develop hypothesis testing procedures for both selected variables and learned representations, bridging deep representation learning with formal statistical inference. Across biomedical and vision benchmarks, DeepIn improves both predictive accuracy and interpretability, reducing error by up to 30% on real-world datasets while automatically uncovering human-interpretable discriminative patterns. Our results suggest that interpretability and statistical rigor can be embedded directly into deep architectures without sacrificing performance.

View on arXiv PDF

Similar