LG AISep 11, 2023

Uncovering mesa-optimization algorithms in Transformers

Johannes von Oswald, Maximilian Schlegel, Alexander Meulemans, Seijin Kobayashi, Eyvind Niklasson, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu

DeepMind

arXiv:2309.05858v232.6101 citationsh-index: 75

Originality Highly original

AI Analysis

This explains the poorly understood origins of in-context learning in Transformers, which is foundational for improving model design.

The paper investigated how autoregressive Transformers develop in-context learning capabilities through standard next-token prediction training, discovering that this process creates a subsidiary gradient-based optimization algorithm that enables strong generalization on unseen sequences.

Some autoregressive models exhibit in-context learning capabilities: being able to learn as an input sequence is processed, without undergoing any parameter changes, and without being explicitly trained to do so. The origins of this phenomenon are still poorly understood. Here we analyze a series of Transformer models trained to perform synthetic sequence prediction tasks, and discover that standard next-token prediction error minimization gives rise to a subsidiary learning algorithm that adjusts the model as new inputs are revealed. We show that this process corresponds to gradient-based optimization of a principled objective function, which leads to strong generalization performance on unseen sequences. Our findings explain in-context learning as a product of autoregressive loss minimization and inform the design of new optimization-based Transformer layers.

View on arXiv PDF

Similar