LGMay 19, 2025

Koopman Autoencoders Learn Neural Representation Dynamics

arXiv:2505.12809v11 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the problem of understanding and editing neural network dynamics for researchers, though it appears incremental as it builds on existing autoencoder and dynamical systems concepts.

The paper tackled modeling neural network internal transformations using dynamical systems theory by introducing Koopman autoencoders, which learn surrogate models that predict representation evolution and enable targeted class unlearning in tasks like Yin-Yang and MNIST.

This paper explores a simple question: can we model the internal transformations of a neural network using dynamical systems theory? We introduce Koopman autoencoders to capture how neural representations evolve through network layers, treating these representations as states in a dynamical system. Our approach learns a surrogate model that predicts how neural representations transform from input to output, with two key advantages. First, by way of lifting the original states via an autoencoder, it operates in a linear space, making editing the dynamics straightforward. Second, it preserves the topologies of the original representations by regularizing the autoencoding objective. We demonstrate that these surrogate models naturally replicate the progressive topological simplification observed in neural networks. As a practical application, we show how our approach enables targeted class unlearning in the Yin-Yang and MNIST classification tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes