CVDec 4, 2025

IE2Video: Adapting Pretrained Diffusion Models for Event-Based Video Reconstruction

arXiv:2512.05240v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses power constraints in surveillance, robotics, and wearable systems by enabling low-power event-based sensing while maintaining standard video output, though it is incremental as it adapts existing diffusion models.

The paper tackles the problem of reconstructing RGB video from sparse RGB keyframes and continuous event streams to reduce power consumption in continuous video monitoring, achieving a 33% improvement in perceptual quality over an autoregressive baseline.

Continuous video monitoring in surveillance, robotics, and wearable systems faces a fundamental power constraint: conventional RGB cameras consume substantial energy through fixed-rate capture. Event cameras offer sparse, motion-driven sensing with low power consumption, but produce asynchronous event streams rather than RGB video. We propose a hybrid capture paradigm that records sparse RGB keyframes alongside continuous event streams, then reconstructs full RGB video offline -- reducing capture power consumption while maintaining standard video output for downstream applications. We introduce the Image and Event to Video (IE2Video) task: reconstructing RGB video sequences from a single initial frame and subsequent event camera data. We investigate two architectural strategies: adapting an autoregressive model (HyperE2VID) for RGB generation, and injecting event representations into a pretrained text-to-video diffusion model (LTX) via learned encoders and low-rank adaptation. Our experiments demonstrate that the diffusion-based approach achieves 33\% better perceptual quality than the autoregressive baseline (0.283 vs 0.422 LPIPS). We validate our approach across three event camera datasets (BS-ERGB, HS-ERGB far/close) at varying sequence lengths (32-128 frames), demonstrating robust cross-dataset generalization with strong performance on unseen capture configurations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes