CVApr 24

Depth-Aware Rover: A Study of Edge AI and Monocular Vision for Real-World Implementation

arXiv:2604.223317.0h-index: 1
AI Analysis

For robotics researchers, this paper provides an incremental comparison of depth estimation methods in a specific rover application.

The study compares stereo and monocular depth estimation for rover navigation, finding monocular depth estimation with UniDepthV2 more robust and cost-effective in real-world deployment, achieving 0.1 FPS for depth and 10 FPS for detection.

This study analyses simulated and real-world implementations of depth-aware rover navigation, highlighting the transition from stereo vision to monocular depth estimation using edge AI. A Unity-based lunar terrain simulator with stereo cameras and OpenCV's StereoSGBM was used to generate disparity maps. A physical rover built on Raspberry Pi 4 employed UniDepthV2 for monocular metric depth estimation and YOLO12n for real-time object detection. While stereo vision yielded higher accuracy in simulation, the monocular approach proved more robust and cost-effective in real-world deployment, achieving 0.1 FPS for depth and 10 FPS for detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes