ROMar 6

KISS-IMU: Self-supervised Inertial Odometry with Motion-balanced Learning and Uncertainty-aware Inference

arXiv:2603.06205v1
Predicted impact top 69% in RO · last 90 daysOriginality Highly original
AI Analysis

This work addresses the scalability and generalization limitations of inertial odometry for robotic systems by removing the dependency on ground truth data during training, which is a significant problem for deploying robots in unseen and diverse environments.

This paper proposes KISS-IMU, a self-supervised inertial odometry framework that eliminates the need for ground truth data by using LiDAR-based ICP registration and pose graph optimization as a supervisory signal. It achieves robust performance across diverse motion patterns and scenarios by employing motion-aware balanced training and uncertainty-driven adaptive weighting during inference.

Inertial measurement units (IMUs), which provide high-frequency linear acceleration and angular velocity measurements, serve as fundamental sensing modalities in robotic systems. Recent advances in deep neural networks have led to remarkable progress in inertial odometry. However, the heavy reliance on ground truth data during training fundamentally limits scalability and generalization to unseen and diverse environments. We propose KISS-IMU, a novel self-supervised inertial odometry framework that eliminates ground truth dependency by leveraging simple LiDAR-based ICP registration and pose graph optimization as a supervisory signal. Our approach embodies two key principles: keeping the IMU stable through motion-aware balanced training and keeping the IMU strong through uncertainty-driven adaptive weighting during inference. To evaluate performance across diverse motion patterns and scenarios, we conducted comprehensive experiments on various real-world platforms, including quadruped robots. Importantly, we train only the IMU network in a self-supervised manner, with LiDAR serving solely as a lightweight supervisory signal rather than requiring additional learnable processes. This design enables the framework to ensure robustness without relying on joint multi-modal learning or ground truth supervision. The supplementary materials are available at https://sparolab.github.io/research/kiss_imu.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes