IRJan 24, 2022

Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

arXiv:2201.09996v118 citations
AI Analysis

This provides a tool for researchers in information retrieval to conduct reproducible CLIR experiments, but it is incremental as it builds on existing frameworks by adding cross-language support.

The authors tackled the lack of software frameworks supporting cross-language information retrieval (CLIR) by creating Patapsco, a Python framework that addresses the complexity of multi-language experiments, and they reported results on standard CLIR collections using multiple settings.

While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many language pairs, to be scalable to large document collections, and to support reproducible experiments driven by a configuration file. We include Patapsco results on standard CLIR collections using multiple settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes