ROCVSep 16, 2025

Unleashing the Power of Discrete-Time State Representation: Ultrafast Target-based IMU-Camera Spatial-Temporal Calibration

arXiv:2509.12846v11 citationsh-index: 6Has Code
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in visual-inertial fusion for applications like robot navigation and augmented reality, offering an incremental improvement in calibration efficiency.

The paper tackles the high computational cost of continuous-time state representation in IMU-camera spatial-temporal calibration by proposing a novel discrete-time method, achieving extremely efficient calibration with potential time savings of one minute per device, which could total 2083 work days for one million devices.

Visual-inertial fusion is crucial for a large amount of intelligent and autonomous applications, such as robot navigation and augmented reality. To bootstrap and achieve optimal state estimation, the spatial-temporal displacements between IMU and cameras must be calibrated in advance. Most existing calibration methods adopt continuous-time state representation, more specifically the B-spline. Despite these methods achieve precise spatial-temporal calibration, they suffer from high computational cost caused by continuous-time state representation. To this end, we propose a novel and extremely efficient calibration method that unleashes the power of discrete-time state representation. Moreover, the weakness of discrete-time state representation in temporal calibration is tackled in this paper. With the increasing production of drones, cellphones and other visual-inertial platforms, if one million devices need calibration around the world, saving one minute for the calibration of each device means saving 2083 work days in total. To benefit both the research and industry communities, our code will be open-source.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes