CVFeb 14, 2020

End-to-end Learning of Object Motion Estimation from Retinal Events for Event-based Object Tracking

arXiv:2002.05911v161 citations
AI Analysis

This addresses object tracking for event camera applications, offering an incremental improvement with a novel representation and network.

The paper tackles object motion estimation for event-based object tracking by proposing a deep neural network that regresses a 5-DoF parametric motion model, showing superiority in challenging environments like fast motion and low illumination compared to state-of-the-art methods.

Event cameras, which are asynchronous bio-inspired vision sensors, have shown great potential in computer vision and artificial intelligence. However, the application of event cameras to object-level motion estimation or tracking is still in its infancy. The main idea behind this work is to propose a novel deep neural network to learn and regress a parametric object-level motion/transform model for event-based object tracking. To achieve this goal, we propose a synchronous Time-Surface with Linear Time Decay (TSLTD) representation, which effectively encodes the spatio-temporal information of asynchronous retinal events into TSLTD frames with clear motion patterns. We feed the sequence of TSLTD frames to a novel Retinal Motion Regression Network (RMRNet) to perform an end-to-end 5-DoF object motion regression. Our method is compared with state-of-the-art object tracking methods, that are based on conventional cameras or event cameras. The experimental results show the superiority of our method in handling various challenging environments such as fast motion and low illumination conditions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes