CVApr 14, 2015

Sketch-based 3D Shape Retrieval using Convolutional Neural Networks

arXiv:1504.03504v1381 citations
Originality Highly original
AI Analysis

This work solves the problem of sketch-based 3D shape retrieval for graphics and computer vision applications, offering a novel method that reduces ambiguity and improves accuracy, though it is incremental in its domain-specific focus.

The paper tackles the problem of retrieving 3D shapes from 2D sketches by addressing issues with subjective 'best views' and hand-crafted features in existing methods, proposing a minimalist approach with only two predefined views and Siamese CNNs to learn features, resulting in significant outperformance over state-of-the-art approaches across all metrics on three benchmark datasets.

Retrieving 3D models from 2D human sketches has received considerable attention in the areas of graphics, image retrieval, and computer vision. Almost always in state of the art approaches a large amount of "best views" are computed for 3D models, with the hope that the query sketch matches one of these 2D projections of 3D models using predefined features. We argue that this two stage approach (view selection -- matching) is pragmatic but also problematic because the "best views" are subjective and ambiguous, which makes the matching inputs obscure. This imprecise nature of matching further makes it challenging to choose features manually. Instead of relying on the elusive concept of "best views" and the hand-crafted features, we propose to define our views using a minimalism approach and learn features for both sketches and views. Specifically, we drastically reduce the number of views to only two predefined directions for the whole dataset. Then, we learn two Siamese Convolutional Neural Networks (CNNs), one for the views and one for the sketches. The loss function is defined on the within-domain as well as the cross-domain similarities. Our experiments on three benchmark datasets demonstrate that our method is significantly better than state of the art approaches, and outperforms them in all conventional metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes