CLJun 20, 2023

One model to rule them all: ranking Slovene summarizers

arXiv:2306.11518v21.33 citationsh-index: 24

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of model selection for text summarization in a less-resourced language, but it is incremental as it builds on existing models without introducing a new paradigm.

The authors tackled the problem of selecting the best text summarization model for Slovene by proposing SloMetaSum, a system that uses a neural network to recommend the most suitable model based on input text properties, achieving successful automation of manual model selection.

Text summarization is an essential task in natural language processing, and researchers have developed various approaches over the years, ranging from rule-based systems to neural networks. However, there is no single model or approach that performs well on every type of text. We propose a system that recommends the most suitable summarization model for a given text. The proposed system employs a fully connected neural network that analyzes the input content and predicts which summarizer should score the best in terms of ROUGE score for a given input. The meta-model selects among four different summarization models, developed for the Slovene language, using different properties of the input, in particular its Doc2Vec document representation. The four Slovene summarization models deal with different challenges associated with text summarization in a less-resourced language. We evaluate the proposed SloMetaSum model performance automatically and parts of it manually. The results show that the system successfully automates the step of manually selecting the best model.

View on arXiv PDF

Similar