CVNov 25, 2025

ShelfRectNet: Single View Shelf Image Rectification with Homography Estimation

arXiv:2511.20335v1
Originality Incremental advance
AI Analysis

This provides an incremental improvement for retail applications by enabling better shelf monitoring and product alignment from arbitrary angles.

The paper tackles the problem of rectifying shelf images from a single viewpoint by estimating a homography matrix, achieving a mean corner error of 1.298 pixels and demonstrating competitive accuracy and speed compared to existing methods.

Estimating homography from a single image remains a challenging yet practically valuable task, particularly in domains like retail, where only one viewpoint is typically available for shelf monitoring and product alignment. In this paper, we present a deep learning framework that predicts a 4-point parameterized homography matrix to rectify shelf images captured from arbitrary angles. Our model leverages a ConvNeXt-based backbone for enhanced feature representation and adopts normalized coordinate regression for improved stability. To address data scarcity and promote generalization, we introduce a novel augmentation strategy by modeling and sampling synthetic homographies. Our method achieves a mean corner error of 1.298 pixels on the test set. When compared with both classical computer vision and deep learning-based approaches, our method demonstrates competitive performance in both accuracy and inference speed. Together, these results establish our approach as a robust and efficient solution for realworld single-view rectification. To encourage further research in this domain, we will make our dataset, ShelfRectSet, and code publicly available

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes