LGMay 19, 2025

Koopman Autoencoders Learn Neural Representation Dynamics

Nishant Suresh Aswani, Saif Eddin Jabari

arXiv:2505.12809v17.11 citationsh-index: 19

Originality Incremental advance

AI Analysis

This addresses the problem of understanding and editing neural network dynamics for researchers, though it appears incremental as it builds on existing autoencoder and dynamical systems concepts.

The paper tackled modeling neural network internal transformations using dynamical systems theory by introducing Koopman autoencoders, which learn surrogate models that predict representation evolution and enable targeted class unlearning in tasks like Yin-Yang and MNIST.

This paper explores a simple question: can we model the internal transformations of a neural network using dynamical systems theory? We introduce Koopman autoencoders to capture how neural representations evolve through network layers, treating these representations as states in a dynamical system. Our approach learns a surrogate model that predicts how neural representations transform from input to output, with two key advantages. First, by way of lifting the original states via an autoencoder, it operates in a linear space, making editing the dynamics straightforward. Second, it preserves the topologies of the original representations by regularizing the autoencoding objective. We demonstrate that these surrogate models naturally replicate the progressive topological simplification observed in neural networks. As a practical application, we show how our approach enables targeted class unlearning in the Yin-Yang and MNIST classification tasks.

View on arXiv PDF

Similar