CLMar 29, 2023

Summarizing Indian Languages using Multilingual Transformers based Models

arXiv:2303.16657v11.312 citationsh-index: 11

Originality Synthesis-oriented

AI Analysis

This work addresses summarization in low-resource Indian languages, but it is incremental as it applies existing models to available datasets.

The study evaluated multilingual transformer models (IndicBART and mT5) for summarizing Indian languages, reporting ROUGE-1 to ROUGE-4 scores as performance metrics.

With the advent of multilingual models like mBART, mT5, IndicBART etc., summarization in low resource Indian languages is getting a lot of attention now a days. But still the number of datasets is low in number. In this work, we (Team HakunaMatata) study how these multilingual models perform on the datasets which have Indian languages as source and target text while performing summarization. We experimented with IndicBART and mT5 models to perform the experiments and report the ROUGE-1, ROUGE-2, ROUGE-3 and ROUGE-4 scores as a performance metric.

View on arXiv PDF

Similar