CLMar 29, 2023

Summarizing Indian Languages using Multilingual Transformers based Models

arXiv:2303.16657v112 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This work addresses summarization in low-resource Indian languages, but it is incremental as it applies existing models to available datasets.

The study evaluated multilingual transformer models (IndicBART and mT5) for summarizing Indian languages, reporting ROUGE-1 to ROUGE-4 scores as performance metrics.

With the advent of multilingual models like mBART, mT5, IndicBART etc., summarization in low resource Indian languages is getting a lot of attention now a days. But still the number of datasets is low in number. In this work, we (Team HakunaMatata) study how these multilingual models perform on the datasets which have Indian languages as source and target text while performing summarization. We experimented with IndicBART and mT5 models to perform the experiments and report the ROUGE-1, ROUGE-2, ROUGE-3 and ROUGE-4 scores as a performance metric.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes