CVAug 7, 2025

Robust Image Stitching with Optimal Plane

arXiv:2508.05903v13 citationsh-index: 18Has CodeIEEE Trans Vis Comput Graph
Originality Incremental advance
AI Analysis

This work addresses image stitching challenges for computer vision applications, offering incremental advancements in robustness and naturalness.

The paper tackles the problem of robust and natural image stitching by proposing RopStitch, an unsupervised deep framework that incorporates a dual-branch architecture for content perception and uses virtual optimal planes to resolve alignment-structure conflicts, achieving significant improvements in scene robustness and content naturalness across diverse datasets.

We present \textit{RopStitch}, an unsupervised deep image stitching framework with both robustness and naturalness. To ensure the robustness of \textit{RopStitch}, we propose to incorporate the universal prior of content perception into the image stitching model by a dual-branch architecture. It separately captures coarse and fine features and integrates them to achieve highly generalizable performance across diverse unseen real-world scenes. Concretely, the dual-branch model consists of a pretrained branch to capture semantically invariant representations and a learnable branch to extract fine-grained discriminative features, which are then merged into a whole by a controllable factor at the correlation level. Besides, considering that content alignment and structural preservation are often contradictory to each other, we propose a concept of virtual optimal planes to relieve this conflict. To this end, we model this problem as a process of estimating homography decomposition coefficients, and design an iterative coefficient predictor and minimal semantic distortion constraint to identify the optimal plane. This scheme is finally incorporated into \textit{RopStitch} by warping both views onto the optimal plane bidirectionally. Extensive experiments across various datasets demonstrate that \textit{RopStitch} significantly outperforms existing methods, particularly in scene robustness and content naturalness. The code is available at {\color{red}https://github.com/MmelodYy/RopStitch}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes