CLApr 23, 2022

WikiMulti: a Corpus for Cross-Lingual Summarization

arXiv:2204.11104v14 citationsh-index: 13Has Code
Originality Synthesis-oriented
AI Analysis

This work provides a resource for researchers in cross-lingual summarization, but it is incremental as it focuses on dataset creation and baseline evaluation.

The authors introduced WikiMulti, a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages, and evaluated existing methods as baselines for further studies.

Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: https://github.com/tikhonovpavel/wikimulti

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes