IRCVNov 27, 2025

UNION: A Lightweight Target Representation for Efficient Zero-Shot Image-Guided Retrieval with Optional Textual Queries

arXiv:2511.22253v1
Originality Incremental advance
AI Analysis

This work addresses efficient zero-shot retrieval for tasks like composed image retrieval and sketch-based image retrieval, offering a generalizable solution with minimal training data, though it is incremental as it builds on existing vision-language models.

The paper tackled the problem of image-guided retrieval with optional text (IGROT) under low-data supervision by introducing UNION, a lightweight target representation that fuses image embeddings with a null-text prompt, achieving competitive results such as a CIRCO mAP@50 of 38.5 and Sketchy mAP@200 of 82.7 with only 5,000 training samples.

Image-Guided Retrieval with Optional Text (IGROT) is a general retrieval setting where a query consists of an anchor image, with or without accompanying text, aiming to retrieve semantically relevant target images. This formulation unifies two major tasks: Composed Image Retrieval (CIR) and Sketch-Based Image Retrieval (SBIR). In this work, we address IGROT under low-data supervision by introducing UNION, a lightweight and generalisable target representation that fuses the image embedding with a null-text prompt. Unlike traditional approaches that rely on fixed target features, UNION enhances semantic alignment with multimodal queries while requiring no architectural modifications to pretrained vision-language models. With only 5,000 training samples - from LlavaSCo for CIR and Training-Sketchy for SBIR - our method achieves competitive results across benchmarks, including CIRCO mAP@50 of 38.5 and Sketchy mAP@200 of 82.7, surpassing many heavily supervised baselines. This demonstrates the robustness and efficiency of UNION in bridging vision and language across diverse query types.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes