CLMar 5, 2021

Multi-document Summarization using Semantic Role Labeling and Semantic Graph for Indonesian News Article

arXiv:2103.03736v15 citations
Originality Synthesis-oriented
AI Analysis

This work addresses summarization for Indonesian news, an incremental improvement over existing methods in a domain-specific context.

The paper tackled multi-document summarization for Indonesian news articles by proposing a system that uses semantic role labeling and a semantic graph, replacing SVOA extraction with an SRL model and a genetic algorithm with a decision tree classifier. The result was improved performance, achieving 0.313 average ROUGE-2 recall for 100-word summaries and 0.394 for 200-word summaries.

In this paper, we proposed a multi-document summarization system using semantic role labeling (SRL) and semantic graph for Indonesian news articles. In order to improve existing summarizer, our system modified summarizer that employed subject, predicate, object, and adverbial (SVOA) extraction for predicate argument structure (PAS) extraction. SVOA extraction is replaced with SRL model for Indonesian. We also replace the genetic algorithm to identify important PAS with the decision tree classifier since the summarizer without genetic algorithm gave better performance. The decision tree model is employed to identify important PAS. The decision tree model with 10 features achieved better performance than decision tree with 4 sentence features. Experiments and evaluations are conducted to generate 100 words summary and 200 words summary. The evaluation shows the proposed model get 0.313 average ROUGE-2 recall in 100 words summary and 0.394 average ROUGE-2 recall in 200 words summary.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes