NCCVMar 26, 2020

Going in circles is the way forward: the role of recurrence in visual inference

arXiv:2003.12128v396 citations
AI Analysis

This is a conceptual argument for shifting paradigms in visual recognition models, potentially impacting both neuroscience and AI engineering, but it is incremental as it builds on known insights about RNNs without presenting new empirical results.

The paper argues that recurrent neural networks (RNNs) are more general than feedforward neural networks (FNNs) for visual inference, challenging the common reliance on feedforward models in computational neuroscience and computer vision, and highlights five potential benefits of recurrence such as increased computational depth and hardware compression.

Biological visual systems exhibit abundant recurrent connectivity. State-of-the-art neural network models for visual recognition, by contrast, rely heavily or exclusively on feedforward computation. Any finite-time recurrent neural network (RNN) can be unrolled along time to yield an equivalent feedforward neural network (FNN). This important insight suggests that computational neuroscientists may not need to engage recurrent computation, and that computer-vision engineers may be limiting themselves to a special case of FNN if they build recurrent models. Here we argue, to the contrary, that FNNs are a special case of RNNs and that computational neuroscientists and engineers should engage recurrence to understand how brains and machines can (1) achieve greater and more flexible computational depth, (2) compress complex computations into limited hardware, (3) integrate priors and priorities into visual inference through expectation and attention, (4) exploit sequential dependencies in their data for better inference and prediction, and (5) leverage the power of iterative computation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes