IRAICLOct 11, 2012

Artex is AnotheR TEXt summarizer

arXiv:1210.3312v119 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for text summarization tasks, potentially benefiting researchers and practitioners in natural language processing.

The paper tackles automatic text summarization by introducing Artex, which ranks sentences using inner products with document and lexical vectors and assembles top-ranked sentences without rule-based post-processing, achieving interesting results on multilingual datasets from DUC and TAC.

This paper describes Artex, another algorithm for Automatic Text Summarization. In order to rank sentences, a simple inner product is calculated between each sentence, a document vector (text topic) and a lexical vector (vocabulary used by a sentence). Summaries are then generated by assembling the highest ranked sentences. No ruled-based linguistic post-processing is necessary in order to obtain summaries. Tests over several datasets (coming from Document Understanding Conferences (DUC), Text Analysis Conferences (TAC), evaluation campaigns, etc.) in French, English and Spanish have shown that summarizer achieves interesting results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes