CVApr 16, 2025

CodingHomo: Bootstrapping Deep Homography With Video Coding

arXiv:2504.12165v14 citationsh-index: 10Has CodeIEEE transactions on circuits and systems for video technology (Print)
Originality Incremental advance
AI Analysis

This work addresses homography estimation for computer vision applications, presenting an incremental improvement through a novel integration of video coding techniques.

The paper tackles the challenge of accurate homography estimation in complex motions by introducing CodingHomo, an unsupervised framework that leverages video coding motion vectors, resulting in improved performance over state-of-the-art unsupervised methods with enhanced robustness and generalizability.

Homography estimation is a fundamental task in computer vision with applications in diverse fields. Recent advances in deep learning have improved homography estimation, particularly with unsupervised learning approaches, offering increased robustness and generalizability. However, accurately predicting homography, especially in complex motions, remains a challenge. In response, this work introduces a novel method leveraging video coding, particularly by harnessing inherent motion vectors (MVs) present in videos. We present CodingHomo, an unsupervised framework for homography estimation. Our framework features a Mask-Guided Fusion (MGF) module that identifies and utilizes beneficial features among the MVs, thereby enhancing the accuracy of homography prediction. Additionally, the Mask-Guided Homography Estimation (MGHE) module is presented for eliminating undesired features in the coarse-to-fine homography refinement process. CodingHomo outperforms existing state-of-the-art unsupervised methods, delivering good robustness and generalizability. The code and dataset are available at: \href{github}{https://github.com/liuyike422/CodingHomo

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes