LGMay 30, 2022

Agnostic Physics-Driven Deep Learning

Benjamin Scellier, Siddhartha Mishra, Yoshua Bengio, Yann Ollivier

arXiv:2205.15021v115.119 citationsh-index: 57

Originality Highly original

AI Analysis

This work addresses the challenge of implementing gradient-based learning in unknown or poorly characterized physical systems, offering a novel approach that could impact hardware design and biological learning mechanisms.

The paper tackles the problem of enabling statistical learning in physical systems without gradient computations by introducing Agnostic Equilibrium Propagation (Aeqprop), which uses energy minimization and nudging to perform true gradient steps, broadening hardware applicability to any system with controllable parameters.

This work establishes that a physical system can perform statistical learning without gradient computations, via an Agnostic Equilibrium Propagation (Aeqprop) procedure that combines energy minimization, homeostatic control, and nudging towards the correct response. In Aeqprop, the specifics of the system do not have to be known: the procedure is based only on external manipulations, and produces a stochastic gradient descent without explicit gradient computations. Thanks to nudging, the system performs a true, order-one gradient step for each training sample, in contrast with order-zero methods like reinforcement or evolutionary strategies, which rely on trial and error. This procedure considerably widens the range of potential hardware for statistical learning to any system with enough controllable parameters, even if the details of the system are poorly known. Aeqprop also establishes that in natural (bio)physical systems, genuine gradient-based statistical learning may result from generic, relatively simple mechanisms, without backpropagation and its requirement for analytic knowledge of partial derivatives.

View on arXiv PDF

Similar