Extending Translate-Train for ColBERT-X to African Language CLIR
This work addresses cross-language information retrieval for African languages, but it is incremental as it applies existing methods to a new dataset.
The authors tackled cross-language information retrieval for African languages by using machine translation to translate documents and training passages, and employing ColBERT-X as the retrieval model, achieving results submitted for the CIRAL CLIR tasks at FIRE 2023.
This paper describes the submission runs from the HLTCOE team at the CIRAL CLIR tasks for African languages at FIRE 2023. Our submissions use machine translation models to translate the documents and the training passages, and ColBERT-X as the retrieval model. Additionally, we present a set of unofficial runs that use an alternative training procedure with a similar training setting.