CLOct 22, 2020

Summarizing Utterances from Japanese Assembly Minutes using Political Sentence-BERT-based Method for QA Lab-PoliInfo-2 Task of NTCIR-15

arXiv:2010.12077v16 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for efficient summarization of lengthy political discussions in Japanese, though it appears incremental as it adapts existing methods to a specific domain.

The paper tackled the problem of summarizing Japanese political meeting utterances by creating a Japanese Political Sentence-BERT model, achieving results in the NTCIR-15 QA Lab-PoliInfo-2 task without labeled data.

There are many discussions held during political meetings, and a large number of utterances for various topics is included in their transcripts. We need to read all of them if we want to follow speakers\' intentions or opinions about a given topic. To avoid such a costly and time-consuming process to grasp often longish discussions, NLP researchers work on generating concise summaries of utterances. Summarization subtask in QA Lab-PoliInfo-2 task of the NTCIR-15 addresses this problem for Japanese utterances in assembly minutes, and our team (SKRA) participated in this subtask. As a first step for summarizing utterances, we created a new pre-trained sentence embedding model, i.e. the Japanese Political Sentence-BERT. With this model, we summarize utterances without labelled data. This paper describes our approach to solving the task and discusses its results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes