CVIRLGMLJan 18, 2020

Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval

arXiv:2001.06657v133 citations
AI Analysis

This addresses the practical limitation in sketch-based image retrieval where not all classes are available during training, benefiting applications like visual search with incomplete data.

The paper tackles the problem of retrieving images from sketches of unseen classes by proposing a Stacked Adversarial Network combined with a Siamese Network, which reduces sketch-based image retrieval to an image-to-image retrieval problem and shows significant improvements in both standard and generalized zero-shot learning settings on TU-Berlin and Sketchy databases.

Conventional approaches to Sketch-Based Image Retrieval (SBIR) assume that the data of all the classes are available during training. The assumption may not always be practical since the data of a few classes may be unavailable, or the classes may not appear at the time of training. Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) relaxes this constraint and allows the algorithm to handle previously unseen classes during the test. This paper proposes a generative approach based on the Stacked Adversarial Network (SAN) and the advantage of Siamese Network (SN) for ZS-SBIR. While SAN generates a high-quality sample, SN learns a better distance metric compared to that of the nearest neighbor search. The capability of the generative model to synthesize image features based on the sketch reduces the SBIR problem to that of an image-to-image retrieval problem. We evaluate the efficacy of our proposed approach on TU-Berlin, and Sketchy database in both standard ZSL and generalized ZSL setting. The proposed method yields a significant improvement in standard ZSL as well as in a more challenging generalized ZSL setting (GZSL) for SBIR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes