IRCLJan 11, 2012

Bengali text summarization by sentence extraction

arXiv:1201.2240v176 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for text summarization tools in Bengali, an under-resourced language, but it is incremental as it applies existing sentence extraction methods to a new domain.

The paper tackles the problem of automatic text summarization for Bengali by developing a method that extracts important sentences from documents to produce summaries, addressing the lack of existing techniques for this language.

Text summarization is a process to produce an abstract or a summary by selecting significant portion of the information from one or more texts. In an automatic text summarization process, a text is given to the computer and the computer returns a shorter less redundant extract or abstract of the original text(s). Many techniques have been developed for summarizing English text(s). But, a very few attempts have been made for Bengali text summarization. This paper presents a method for Bengali text summarization which extracts important sentences from a Bengali document to produce a summary.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes