CVROSep 21, 2025

SLAM-Former: Putting SLAM into One Transformer

arXiv:2509.16909v13 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses SLAM for robotics and computer vision applications, presenting a novel neural approach that is incremental in integrating transformers into SLAM.

The paper tackles the problem of Simultaneous Localization and Mapping (SLAM) by integrating full SLAM capabilities into a single transformer, achieving superior or highly competitive performance compared to state-of-the-art dense SLAM methods.

We present SLAM-Former, a novel neural approach that integrates full SLAM capabilities into a single transformer. Similar to traditional SLAM systems, SLAM-Former comprises both a frontend and a backend that operate in tandem. The frontend processes sequential monocular images in real-time for incremental mapping and tracking, while the backend performs global refinement to ensure a geometrically consistent result. This alternating execution allows the frontend and backend to mutually promote one another, enhancing overall system performance. Comprehensive experimental results demonstrate that SLAM-Former achieves superior or highly competitive performance compared to state-of-the-art dense SLAM methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes