IRAIDCApr 18, 2024

De-DSI: Decentralised Differentiable Search Index

arXiv:2404.12237v23 citationsh-index: 3EuroMLSys@EuroSys
Originality Incremental advance
AI Analysis

This addresses the challenge of scalable and efficient information retrieval in decentralized systems, offering a novel approach that could reduce reliance on intermediaries, though it appears incremental in its adaptation of existing DSI concepts.

The study tackled the problem of decentralized information retrieval by introducing De-DSI, a framework that combines large language models with decentralized differentiable search indices, achieving retrieval success comparable to centralized methods while distributing computational complexity across a network.

This study introduces De-DSI, a novel framework that fuses large language models (LLMs) with genuine decentralization for information retrieval, particularly employing the differentiable search index (DSI) concept in a decentralized setting. Focused on efficiently connecting novel user queries with document identifiers without direct document access, De-DSI operates solely on query-docid pairs. To enhance scalability, an ensemble of DSI models is introduced, where the dataset is partitioned into smaller shards for individual model training. This approach not only maintains accuracy by reducing the number of data each model needs to handle but also facilitates scalability by aggregating outcomes from multiple models. This aggregation uses a beam search to identify top docids and applies a softmax function for score normalization, selecting documents with the highest scores for retrieval. The decentralized implementation demonstrates that retrieval success is comparable to centralized methods, with the added benefit of the possibility of distributing computational complexity across the network. This setup also allows for the retrieval of multimedia items through magnet links, eliminating the need for platforms or intermediaries.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes