CVIVNov 28, 2019

Motion Equivariance OF Event-based Camera Data with the Temporal Normalization Transform

arXiv:1911.12801v12 citations
Originality Synthesis-oriented
AI Analysis

This work addresses motion robustness in event-based vision, which is incremental as it builds on existing equivariance studies for a specific domain.

The paper tackles the problem of CNN robustness to unseen motions in event-based camera object recognition by exploring equivariance properties, aiming to produce predictable feature deformations under input transformations.

In this work, we focus on using convolution neural networks (CNN) to perform object recognition on the event data. In object recognition, it is important for a neural network to be robust to the variations of the data during testing. For traditional cameras, translations are well handled because CNNs are naturally equivariant to translations. However, because event cameras record the change of light intensity of an image, the geometric shape of event volumes will not only depend on the objects but also on their relative motions with respect to the camera. The deformation of the events caused by motions causes the CNN to be less robust to unseen motions during inference. To address this problem, we would like to explore the equivariance property of CNNs, a well-studied area that demonstrates to produce predictable deformation of features under certain transformations of the input image.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes