CVLGJun 24, 2020

DISK: Learning local features with policy gradient

arXiv:2006.13566v2570 citations
AI Analysis

This addresses the problem of sparse keypoint selection and matching in computer vision, offering a novel approach that could improve feature extraction for tasks like image matching.

The paper tackles the challenge of learning local feature frameworks end-to-end by introducing DISK, a method that uses reinforcement learning to optimize for correct feature matches, achieving state-of-the-art results on three public benchmarks.

Local feature frameworks are difficult to learn in an end-to-end fashion, due to the discreteness inherent to the selection and matching of sparse keypoints. We introduce DISK (DIScrete Keypoints), a novel method that overcomes these obstacles by leveraging principles from Reinforcement Learning (RL), optimizing end-to-end for a high number of correct feature matches. Our simple yet expressive probabilistic model lets us keep the training and inference regimes close, while maintaining good enough convergence properties to reliably train from scratch. Our features can be extracted very densely while remaining discriminative, challenging commonly held assumptions about what constitutes a good keypoint, as showcased in Fig. 1, and deliver state-of-the-art results on three public benchmarks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes