CVLGSep 2, 2019

Reusing Convolutional Activations from Frame to Frame to Speed up Training and Inference

arXiv:1909.05632v2
Originality Synthesis-oriented
AI Analysis

This incremental improvement addresses efficiency for domains like video processing and time series analysis.

The paper tackles the problem of redundant computation in convolutional neural networks for sequential data by reusing activations from previous frames, resulting in computational savings for both training and inference.

When processing similar frames in succession, we can take advantage of the locality of the convolution operation to reevaluate only portions of the image that changed from the previous frame. By saving the output of a layer of convolutions and calculating the change from frame to frame, we can reuse previous activations and save computational resources that would otherwise be wasted recalculating convolutions whose outputs we have already observed. This technique can be applied to many domains, such as processing videos from stationary video cameras, studying the effects of occluding or distorting sections of images, applying convolution to multiple frames of audio or time series data, or playing Atari games. Furthermore, this technique can be applied to speed up both training and inference.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes