CVDec 26, 2018

RegNet: Learning the Optimization of Direct Image-to-Image Pose Registration

arXiv:1812.10212v118 citations
Originality Incremental advance
AI Analysis

This addresses a problem in computer vision for applications like robotics and AR/VR by improving pose registration robustness and speed, though it is incremental as it builds on deep learning approaches.

The paper tackles the limited convergence range and sensitivity to lighting in direct image-to-image pose registration by proposing RegNet, an end-to-end network that learns feature representations and partial derivatives, resulting in more robust and faster convergence with fewer iterations for large-baseline image pairs.

Direct image-to-image alignment that relies on the optimization of photometric error metrics suffers from limited convergence range and sensitivity to lighting conditions. Deep learning approaches has been applied to address this problem by learning better feature representations using convolutional neural networks, yet still require a good initialization. In this paper, we demonstrate that the inaccurate numerical Jacobian limits the convergence range which could be improved greatly using learned approaches. Based on this observation, we propose a novel end-to-end network, RegNet, to learn the optimization of image-to-image pose registration. By jointly learning feature representation for each pixel and partial derivatives that replace handcrafted ones (e.g., numerical differentiation) in the optimization step, the neural network facilitates end-to-end optimization. The energy landscape is constrained on both the feature representation and the learned Jacobian, hence providing more flexibility for the optimization as a consequence leads to more robust and faster convergence. In a series of experiments, including a broad ablation study, we demonstrate that RegNet is able to converge for large-baseline image pairs with fewer iterations.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes