ROAIMar 9

Long-Short Term Agents for Pure-Vision Bronchoscopy Robotic Autonomy

arXiv:2603.07909v1
Predicted impact top 30% in RO · last 90 daysOriginality Highly original
AI Analysis

This work addresses the problem of accurate intraoperative navigation for robot-assisted endoluminal intervention, specifically for bronchoscopy, by eliminating the need for external localization technologies, which simplifies hardware and reduces vulnerability to anatomical mismatch.

This paper presents a vision-only autonomy framework for long-horizon bronchoscopic navigation using preoperative CT-derived virtual targets and live endoscopic video. The system achieved 100% success to all planned segmental targets in a phantom, 80% success to the eighth generation ex vivo, and comparable in vivo navigation performance to an expert bronchoscopist.

Accurate intraoperative navigation is essential for robot-assisted endoluminal intervention, but remains difficult because of limited endoscopic field of view and dynamic artifacts. Existing navigation platforms often rely on external localization technologies, such as electromagnetic tracking or shape sensing, which increase hardware complexity and remain vulnerable to intraoperative anatomical mismatch. We present a vision-only autonomy framework that performs long-horizon bronchoscopic navigation using preoperative CT-derived virtual targets and live endoscopic video, without external tracking during navigation. The framework uses hierarchical long-short agents: a short-term reactive agent for continuous low-latency motion control, and a long-term strategic agent for decision support at anatomically ambiguous points. When their recommendations conflict, a world-model critic predicts future visual states for candidate actions and selects the action whose predicted state best matches the target view. We evaluated the system in a high-fidelity airway phantom, three ex vivo porcine lungs, and a live porcine model. The system reached all planned segmental targets in the phantom, maintained 80\% success to the eighth generation ex vivo, and achieved in vivo navigation performance comparable to the expert bronchoscopist. These results support the preclinical feasibility of sensor-free autonomous bronchoscopic navigation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes