CVDec 2, 2021

Event Neural Networks

arXiv:2112.00891v210 citations
Originality Highly original
AI Analysis

This addresses efficiency issues for video processing tasks like pose recognition and object detection, offering a novel method for known bottlenecks.

The paper tackles the problem of computational redundancy in video inference by proposing Event Neural Networks (EvNets), which leverage long-term memory in neurons to achieve roughly an order-of-magnitude reduction in computational costs with minimal accuracy loss.

Video data is often repetitive; for example, the contents of adjacent frames are usually strongly correlated. Such redundancy occurs at multiple levels of complexity, from low-level pixel values to textures and high-level semantics. We propose Event Neural Networks (EvNets), which leverage this redundancy to achieve considerable computation savings during video inference. A defining characteristic of EvNets is that each neuron has state variables that provide it with long-term memory, which allows low-cost, high-accuracy inference even in the presence of significant camera motion. We show that it is possible to transform a wide range of neural networks into EvNets without re-training. We demonstrate our method on state-of-the-art architectures for both high- and low-level visual processing, including pose recognition, object detection, optical flow, and image enhancement. We observe roughly an order-of-magnitude reduction in computational costs compared to conventional networks, with minimal reductions in model accuracy.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes