COMP-PHDIS-NNLGOct 24, 2018

Toward an AI Physicist for Unsupervised Learning

arXiv:1810.10525v474 citationsHas Code
Originality Highly original
AI Analysis

This addresses the challenge of unsupervised learning in complex environments, offering a foundational approach that could impact all of ML/AI, though it is incremental in applying physics-inspired methods to AI.

The paper tackles the problem of improving unsupervised learning by introducing a novel paradigm that learns and manipulates theories, inspired by physics strategies, resulting in significantly faster learning and prediction errors about a billion times smaller than a standard neural net, with exact recovery of theory parameters.

We investigate opportunities and challenges for improving unsupervised machine learning using four common strategies with a long history in physics: divide-and-conquer, Occam's razor, unification and lifelong learning. Instead of using one model to learn everything, we propose a novel paradigm centered around the learning and manipulation of *theories*, which parsimoniously predict both aspects of the future (from past observations) and the domain in which these predictions are accurate. Specifically, we propose a novel generalized-mean-loss to encourage each theory to specialize in its comparatively advantageous domain, and a differentiable description length objective to downweight bad data and "snap" learned theories into simple symbolic formulas. Theories are stored in a "theory hub", which continuously unifies learned theories and can propose theories when encountering new environments. We test our implementation, the toy "AI Physicist" learning agent, on a suite of increasingly complex physics environments. From unsupervised observation of trajectories through worlds involving random combinations of gravity, electromagnetism, harmonic motion and elastic bounces, our agent typically learns faster and produces mean-squared prediction errors about a billion times smaller than a standard feedforward neural net of comparable complexity, typically recovering integer and rational theory parameters exactly. Our agent successfully identifies domains with different laws of motion also for a nonlinear chaotic double pendulum in a piecewise constant force field.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes