CVJan 24, 2017

Improved Descriptors for Patch Matching and Reconstruction

arXiv:1701.06854v43 citations
Originality Incremental advance
AI Analysis

This work addresses the need for more accurate descriptors in computer vision tasks like 3D reconstruction, though it appears incremental as it builds on existing ConvNet methods.

The authors tackled the problem of local image descriptor learning for patch matching and 3D reconstruction by proposing a convolutional neural network-based approach, which outperformed state-of-the-art descriptors on multiple public datasets.

We propose a convolutional neural network (ConvNet) based approach for learning local image descriptors which can be used for significantly improved patch matching and 3D reconstructions. A multi-resolution ConvNet is used for learning keypoint descriptors. We also propose a new dataset consisting of an order of magnitude more number of scenes, images, and positive and negative correspondences compared to the currently available Multi-View Stereo (MVS) [18] dataset. The new dataset also has better coverage of the overall viewpoint, scale, and lighting changes in comparison to the MVS dataset. We evaluate our approach on publicly available datasets, such as Oxford Affine Covariant Regions Dataset (ACRD) [12], MVS [18], Synthetic [6] and Strecha [15] datasets to quantify the image descriptor performance. Scenes from the Oxford ACRD, MVS and Synthetic datasets are used for evaluating the patch matching performance of the learnt descriptors while the Strecha dataset is used to evaluate the 3D reconstruction task. Experiments show that the proposed descriptor outperforms the current state-of-the-art descriptors in both the evaluation tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes