CVApr 23, 2024

Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization

arXiv:2404.15263v18 citationsh-index: 7Has CodeCVPR
Originality Incremental advance
AI Analysis

This addresses the challenge of connecting disjoint video sequences for applications like robotics or augmented reality, representing an incremental improvement with a novel differentiable solver.

The paper tackles the problem of Multi-Session SLAM by introducing a system that tracks camera motion across multiple disjoint videos under a global reference, achieving accurate and robust performance compared to existing approaches.

We introduce a new system for Multi-Session SLAM, which tracks camera motion across multiple disjoint videos under a single global reference. Our approach couples the prediction of optical flow with solver layers to estimate camera pose. The backbone is trained end-to-end using a novel differentiable solver for wide-baseline two-view pose. The full system can connect disjoint sequences, perform visual odometry, and global optimization. Compared to existing approaches, our design is accurate and robust to catastrophic failures. Code is available at github.com/princeton-vl/MultiSlam_DiffPose

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes