CLOct 10, 2025

Hierarchical Indexing with Knowledge Enrichment for Multilingual Video Corpus Retrieval

arXiv:2510.09553v1h-index: 2NLPCC
Originality Incremental advance
AI Analysis

This addresses the challenge of multilingual video corpus retrieval for medical applications, offering a scalable solution for specialized collections.

The paper tackles the problem of retrieving relevant instructional videos from multilingual medical archives by developing a multi-stage framework that integrates multilingual semantics, domain terminology, and efficient long-form processing, achieving state-of-the-art performance on the mVCR test set.

Retrieving relevant instructional videos from multilingual medical archives is crucial for answering complex, multi-hop questions across language boundaries. However, existing systems either compress hour-long videos into coarse embeddings or incur prohibitive costs for fine-grained matching. We tackle the Multilingual Video Corpus Retrieval (mVCR) task in the NLPCC-2025 M4IVQA challenge with a multi-stage framework that integrates multilingual semantics, domain terminology, and efficient long-form processing. Video subtitles are divided into semantically coherent chunks, enriched with concise knowledge-graph (KG) facts, and organized into a hierarchical tree whose node embeddings are generated by a language-agnostic multilingual encoder. At query time, the same encoder embeds the input question; a coarse-to-fine tree search prunes irrelevant branches, and only the top-ranked chunks are re-scored by a lightweight large language model (LLM). This design avoids exhaustive cross-encoder scoring while preserving chunk-level precision. Experiments on the mVCR test set demonstrate state-of-the-art performance, and ablation studies confirm the complementary contributions of KG enrichment, hierarchical indexing, and targeted LLM re-ranking. The proposed method offers an accurate and scalable solution for multilingual retrieval in specialized medical video collections.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes