CTAILGNov 28, 2017

Backprop as Functor: A compositional perspective on supervised learning

arXiv:1711.10455v3127 citations
Originality Incremental advance
AI Analysis

This work offers a foundational, compositional framework for understanding supervised learning algorithms, potentially benefiting researchers in machine learning theory and category theory.

The paper tackles the problem of providing a structural perspective on backpropagation by defining a category where update rules in supervised learning can be composed, showing that gradient descent under specific conditions defines a monoidal functor, which generalizes neural networks.

A supervised learning algorithm searches over a set of functions $A \to B$ parametrised by a space $P$ to find the best approximation to some ideal function $f\colon A \to B$. It does this by taking examples $(a,f(a)) \in A\times B$, and updating the parameter according to some rule. We define a category where these update rules may be composed, and show that gradient descent---with respect to a fixed step size and an error function satisfying a certain property---defines a monoidal functor from a category of parametrised functions to this category of update rules. This provides a structural perspective on backpropagation, as well as a broad generalisation of neural networks.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes