CLAug 19, 2023

I3: Intent-Introspective Retrieval Conditioned on Instructions

arXiv:2308.10025v27 citationsh-index: 38
Originality Highly original
AI Analysis

This addresses the problem of limited generalization in retrieval systems for users needing flexible, intent-aware search across various domains, representing a novel method rather than an incremental improvement.

The paper tackles the challenge of dense retrieval models struggling across diverse tasks with distinct search intents by introducing I3, a unified retrieval system that uses instructions to describe intents and achieves state-of-the-art zero-shot performance on the BEIR benchmark without task-specific training.

Recent studies indicate that dense retrieval models struggle to perform well on a wide variety of retrieval tasks that lack dedicated training data, as different retrieval tasks often entail distinct search intents. To address this challenge, in this work we leverage instructions to flexibly describe retrieval intents and introduce I3, a unified retrieval system that performs Intent-Introspective retrieval across various tasks, conditioned on Instructions without any task-specific training. I3 innovatively incorporates a pluggable introspector in a parameter-isolated manner to comprehend specific retrieval intents by jointly reasoning over the input query and instruction, and seamlessly integrates the introspected intent into the original retrieval model for intent-aware retrieval. Furthermore, we propose progressively-pruned intent learning. It utilizes extensive LLM-generated data to train I3 phase-by-phase, embodying two key designs: progressive structure pruning and drawback extrapolation-based data refinement. Extensive experiments show that in the BEIR benchmark, I3 significantly outperforms baseline methods designed with task-specific retrievers, achieving state-of-the-art zero-shot performance without any task-specific tuning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes