CVJun 8, 2025

UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment

arXiv:2506.07013v14 citationsh-index: 8CAAI Trans Intell Technol
Originality Incremental advance
AI Analysis

This addresses the need for adaptable pose estimation in applications such as autonomous vehicles, drones, and mobile robots, though it appears incremental as it builds on existing self-supervised methods.

The paper tackles the problem of robust monocular visual odometry across diverse platforms and environments by introducing UNO, a unified framework that achieves state-of-the-art performance on benchmarks like KITTI, EuRoC-MAV, and TUM-RGBD.

This work presents UNO, a unified monocular visual odometry framework that enables robust and adaptable pose estimation across diverse environments, platforms, and motion patterns. Unlike traditional methods that rely on deployment-specific tuning or predefined motion priors, our approach generalizes effectively across a wide range of real-world scenarios, including autonomous vehicles, aerial drones, mobile robots, and handheld devices. To this end, we introduce a Mixture-of-Experts strategy for local state estimation, with several specialized decoders that each handle a distinct class of ego-motion patterns. Moreover, we introduce a fully differentiable Gumbel-Softmax module that constructs a robust inter-frame correlation graph, selects the optimal expert decoder, and prunes erroneous estimates. These cues are then fed into a unified back-end that combines pre-trained, scale-independent depth priors with a lightweight bundling adjustment to enforce geometric consistency. We extensively evaluate our method on three major benchmark datasets: KITTI (outdoor/autonomous driving), EuRoC-MAV (indoor/aerial drones), and TUM-RGBD (indoor/handheld), demonstrating state-of-the-art performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes