LGAICVNEMay 13, 2024

The Platonic Representation Hypothesis

arXiv:2405.07987v5355 citationsh-index: 11ICML
Originality Incremental advance
AI Analysis

This addresses the fundamental question of how AI models represent reality, with implications for understanding generalization and model alignment across the field.

The paper argues that representations in AI models, particularly deep networks, are converging across time, domains, and data modalities, hypothesizing this leads toward a shared statistical model of reality termed the 'platonic representation'.

We argue that representations in AI models, particularly deep networks, are converging. First, we survey many examples of convergence in the literature: over time and across multiple domains, the ways by which different neural networks represent data are becoming more aligned. Next, we demonstrate convergence across data modalities: as vision models and language models get larger, they measure distance between datapoints in a more and more alike way. We hypothesize that this convergence is driving toward a shared statistical model of reality, akin to Plato's concept of an ideal reality. We term such a representation the platonic representation and discuss several possible selective pressures toward it. Finally, we discuss the implications of these trends, their limitations, and counterexamples to our analysis.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes