CVROJul 5, 2025

VISC: mmWave Radar Scene Flow Estimation using Pervasive Visual-Inertial Supervision

arXiv:2507.03938v1h-index: 14IROS
Originality Incremental advance
AI Analysis

This addresses the problem of expensive and limited LiDAR data for autonomous vehicles by leveraging more accessible sensors, though it is incremental in improving supervision methods.

The paper tackles scene flow estimation for mmWave radar by using pervasive visual-inertial supervision from smart vehicles, achieving results that outperform state-of-the-art LiDAR-based methods in smoke-filled environments.

This work proposes a mmWave radar's scene flow estimation framework supervised by data from a widespread visual-inertial (VI) sensor suite, allowing crowdsourced training data from smart vehicles. Current scene flow estimation methods for mmWave radar are typically supervised by dense point clouds from 3D LiDARs, which are expensive and not widely available in smart vehicles. While VI data are more accessible, visual images alone cannot capture the 3D motions of moving objects, making it difficult to supervise their scene flow. Moreover, the temporal drift of VI rigid transformation also degenerates the scene flow estimation of static points. To address these challenges, we propose a drift-free rigid transformation estimator that fuses kinematic model-based ego-motions with neural network-learned results. It provides strong supervision signals to radar-based rigid transformation and infers the scene flow of static points. Then, we develop an optical-mmWave supervision extraction module that extracts the supervision signals of radar rigid transformation and scene flow. It strengthens the supervision by learning the scene flow of dynamic points with the joint constraints of optical and mmWave radar measurements. Extensive experiments demonstrate that, in smoke-filled environments, our method even outperforms state-of-the-art (SOTA) approaches using costly LiDARs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes