CVApr 13, 2024

DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector

Johan Edstedt, Georg Bökman, Zhenjun Zhao

arXiv:2404.08928v121.535 citationsh-index: 9Has Code2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Originality Synthesis-oriented

AI Analysis

This work incrementally improves a keypoint detector for computer vision tasks like pose estimation.

The paper tackled issues with the DeDoDe keypoint detector, including clustering, sensitivity to rotations, and evaluation challenges, resulting in improved pose estimation performance from 75.9 to 78.3 mAA on the IMC2022 benchmark.

In this paper, we analyze and improve into the recently proposed DeDoDe keypoint detector. We focus our analysis on some key issues. First, we find that DeDoDe keypoints tend to cluster together, which we fix by performing non-max suppression on the target distribution of the detector during training. Second, we address issues related to data augmentation. In particular, the DeDoDe detector is sensitive to large rotations. We fix this by including 90-degree rotations as well as horizontal flips. Finally, the decoupled nature of the DeDoDe detector makes evaluation of downstream usefulness problematic. We fix this by matching the keypoints with a pretrained dense matcher (RoMa) and evaluating two-view pose estimates. We find that the original long training is detrimental to performance, and therefore propose a much shorter training schedule. We integrate all these improvements into our proposed detector DeDoDe v2 and evaluate it with the original DeDoDe descriptor on the MegaDepth-1500 and IMC2022 benchmarks. Our proposed detector significantly increases pose estimation results, notably from 75.9 to 78.3 mAA on the IMC2022 challenge. Code and weights are available at https://github.com/Parskatt/DeDoDe

View on arXiv PDF Code

Similar