ROMar 22

Motion as a Sensing Modality for Metric Scale in Monocular Visual-Inertial Odometry

arXiv:2603.2674012.1h-index: 1
AI Analysis

For practitioners of monocular VIO, this work provides a theoretical and practical basis for improving metric scale accuracy through trajectory design.

Monocular VIO cannot recover metric scale from vision alone; the paper shows that translational acceleration from curved motion is essential for scale observability. Experiments demonstrate that figure-eight motion reduces scale error to 4.8% compared to 9.2% for straight-line motion.

Monocular visual-inertial odometry (VIO) cannot recover metric scale from vision alone; scale must be resolved through inertial measurements. We present a trajectory-dependent observability analysis showing that translational acceleration, produced by curvature, not constant-speed straight-line travel, is the fundamental source that couples scale to the inertial state. This relationship is formalized through the gravity-acceleration asymmetry in the IMU model, from which we derive rank conditions on the observability matrix and propose a lightweight excitation metric computable from raw IMU data. Controlled experiments on a differential-drive robot with a monocular camera and consumer-grade IMU validate the theory, with straight-line motion yielding 9.2% scale error, circular motion 6.4%, and figure-eight motion 4.8%, with excitation spanning four orders of magnitude. These results establish trajectory design as a practical mechanism for improving metric scale recovery.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes