CVAISep 18, 2024

EventAug: Multifaceted Spatio-Temporal Data Augmentation Methods for Event-based Learning

arXiv:2409.11813v1h-index: 14
Originality Synthesis-oriented
AI Analysis

This addresses data scarcity for event camera applications, though it is incremental as it adapts existing augmentation ideas to a specific domain.

The paper tackles data deficiency and limited diversity in event-based learning by introducing EventAug, a systematic spatio-temporal data augmentation scheme, which improves model robustness and achieves a 4.87% accuracy gain on DVS128 Gesture.

The event camera has demonstrated significant success across a wide range of areas due to its low time latency and high dynamic range. However, the community faces challenges such as data deficiency and limited diversity, often resulting in over-fitting and inadequate feature learning. Notably, the exploration of data augmentation techniques in the event community remains scarce. This work aims to address this gap by introducing a systematic augmentation scheme named EventAug to enrich spatial-temporal diversity. In particular, we first propose Multi-scale Temporal Integration (MSTI) to diversify the motion speed of objects, then introduce Spatial-salient Event Mask (SSEM) and Temporal-salient Event Mask (TSEM) to enrich object variants. Our EventAug can facilitate models learning with richer motion patterns, object variants and local spatio-temporal relations, thus improving model robustness to varied moving speeds, occlusions, and action disruptions. Experiment results show that our augmentation method consistently yields significant improvements across different tasks and backbones (e.g., a 4.87% accuracy gain on DVS128 Gesture). Our code will be publicly available for this community.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes