IVCVNov 27, 2025

ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy

arXiv:2511.22250v11 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of inaccurate geometry estimation in colonoscopy for medical imaging applications, representing an incremental improvement by fine-tuning existing models for a specific domain.

The paper tackled the challenge of estimating 3D geometry from monocular colonoscopy images by adapting geometric foundation models to handle clinical scenes with specularity and textureless regions, achieving state-of-the-art performance in camera pose estimation, monocular depth prediction, and dense 3D point map reconstruction.

Estimating 3D geometry from monocular colonoscopy images is challenging due to non-Lambertian surfaces, moving light sources, and large textureless regions. While recent 3D geometric foundation models eliminate the need for multi-stage pipelines, their performance deteriorates in clinical scenes. These models are primarily trained on natural scene datasets and struggle with specularity and homogeneous textures typical in colonoscopy, leading to inaccurate geometry estimation. In this paper, we present ColonAdapter, a self-supervised fine-tuning framework that adapts geometric foundation models for colonoscopy geometry estimation. Our method leverages pretrained geometric priors while tailoring them to clinical data. To improve performance in low-texture regions and ensure scale consistency, we introduce a Detail Restoration Module (DRM) and a geometry consistency loss. Furthermore, a confidence-weighted photometric loss enhances training stability in clinical environments. Experiments on both synthetic and real datasets demonstrate that our approach achieves state-of-the-art performance in camera pose estimation, monocular depth prediction, and dense 3D point map reconstruction, without requiring ground-truth intrinsic parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes