CVAIOct 21, 2022

Real-time Detection of 2D Tool Landmarks with Synthetic Training Data

arXiv:2210.11991v13 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the problem of reducing manual labeling effort for tool landmark detection in computer vision, though it is incremental as it builds on existing keypoint detection approaches.

The paper tackles real-time 2D landmark detection for physical tools like hammers and screwdrivers by training a deep learning model on synthetic data, achieving better performance than existing methods when trained synthetically.

In this paper a deep learning architecture is presented that can, in real time, detect the 2D locations of certain landmarks of physical tools, such as a hammer or screwdriver. To avoid the labor of manual labeling, the network is trained on synthetically generated data. Training computer vision models on computer generated images, while still achieving good accuracy on real images, is a challenge due to the difference in domain. The proposed method uses an advanced rendering method in combination with transfer learning and an intermediate supervision architecture to address this problem. It is shown that the model presented in this paper, named Intermediate Heatmap Model (IHM), generalizes to real images when trained on synthetic data. To avoid the need for an exact textured 3D model of the tool in question, it is shown that the model will generalize to an unseen tool when trained on a set of different 3D models of the same type of tool. IHM is compared to two existing approaches to keypoint detection and it is shown that it outperforms those at detecting tool landmarks, trained on synthetic data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes