CLAug 13, 2025

UWBa at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval

arXiv:2508.09517v11 citationsh-index: 13
Originality Synthesis-oriented
AI Analysis

This work addresses fact-checking retrieval across languages, but it is incremental as it applies existing models without major methodological innovation.

The paper tackled multilingual and crosslingual fact-checked claim retrieval by developing a zero-shot system using large language models for text embeddings, achieving 7th place in monolingual and 9th in cross-lingual subtasks.

This paper presents a zero-shot system for fact-checked claim retrieval. We employed several state-of-the-art large language models to obtain text embeddings. The models were then combined to obtain the best possible result. Our approach achieved 7th place in monolingual and 9th in cross-lingual subtasks. We used only English translations as an input to the text embedding models since multilingual models did not achieve satisfactory results. We identified the most relevant claims for each post by leveraging the embeddings and measuring cosine similarity. Overall, the best results were obtained by the NVIDIA NV-Embed-v2 model. For some languages, we benefited from model combinations (NV-Embed & GPT or Mistral).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes