IRCLApr 11, 2024

Extending Translate-Train for ColBERT-X to African Language CLIR

arXiv:2404.08134v11 citationsh-index: 32Fire
Originality Synthesis-oriented
AI Analysis

This work addresses cross-language information retrieval for African languages, but it is incremental as it applies existing methods to a new dataset.

The authors tackled cross-language information retrieval for African languages by using machine translation to translate documents and training passages, and employing ColBERT-X as the retrieval model, achieving results submitted for the CIRAL CLIR tasks at FIRE 2023.

This paper describes the submission runs from the HLTCOE team at the CIRAL CLIR tasks for African languages at FIRE 2023. Our submissions use machine translation models to translate the documents and the training passages, and ColBERT-X as the retrieval model. Additionally, we present a set of unofficial runs that use an alternative training procedure with a similar training setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes