CVFeb 13

Unbiased Gradient Estimation for Event Binning via Functional Backpropagation

arXiv:2602.12590v1h-index: 24Has Code
Originality Incremental advance
AI Analysis

This addresses a key bottleneck for researchers and engineers in event-based vision, enabling more efficient learning from raw events, though it is incremental as it builds on existing binning methods.

The paper tackles the problem of biased gradient estimation in event-based vision due to discontinuous binning functions, proposing a framework for unbiased gradient estimation via functional backpropagation that improves learning efficiency. Results include a 3.2% lower RMS error in egomotion estimation, 9.4% lower EPE in optical flow, and 5.1% lower RMS error in SLAM.

Event-based vision encodes dynamic scenes as asynchronous spatio-temporal spikes called events. To leverage conventional image processing pipelines, events are typically binned into frames. However, binning functions are discontinuous, which truncates gradients at the frame level and forces most event-based algorithms to rely solely on frame-based features. Attempts to directly learn from raw events avoid this restriction but instead suffer from biased gradient estimation due to the discontinuities of the binning operation, ultimately limiting their learning efficiency. To address this challenge, we propose a novel framework for unbiased gradient estimation of arbitrary binning functions by synthesizing weak derivatives during backpropagation while keeping the forward output unchanged. The key idea is to exploit integration by parts: lifting the target functions to functionals yields an integral form of the derivative of the binning function during backpropagation, where the cotangent function naturally arises. By reconstructing this cotangent function from the sampled cotangent vector, we compute weak derivatives that provably match long-range finite differences of both smooth and non-smooth targets. Experimentally, our method improves simple optimization-based egomotion estimation with 3.2\% lower RMS error and 1.57$\times$ faster convergence. On complex downstream tasks, we achieve 9.4\% lower EPE in self-supervised optical flow, and 5.1\% lower RMS error in SLAM, demonstrating broad benefits for event-based visual perception. Source code can be found at https://github.com/chjz1024/EventFBP.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes