CVNEJan 30, 2015

Sketch-a-Net that Beats Humans

arXiv:1501.07873v354 citations
Originality Highly original
AI Analysis

It solves the problem of accurate sketch recognition for applications like image retrieval and human-computer interaction, representing a significant advancement over existing methods.

The paper tackled sketch recognition by proposing a multi-scale multi-channel deep neural network framework that explicitly embeds sketch characteristics, achieving performance surpassing humans on the largest human sketch dataset to date.

We propose a multi-scale multi-channel deep neural network framework that, for the first time, yields sketch recognition performance surpassing that of humans. Our superior performance is a result of explicitly embedding the unique characteristics of sketches in our model: (i) a network architecture designed for sketch rather than natural photo statistics, (ii) a multi-channel generalisation that encodes sequential ordering in the sketching process, and (iii) a multi-scale network ensemble with joint Bayesian fusion that accounts for the different levels of abstraction exhibited in free-hand sketches. We show that state-of-the-art deep networks specifically engineered for photos of natural objects fail to perform well on sketch recognition, regardless whether they are trained using photo or sketch. Our network on the other hand not only delivers the best performance on the largest human sketch dataset to date, but also is small in size making efficient training possible using just CPUs.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes