CLOct 22, 2020

Summarizing Utterances from Japanese Assembly Minutes using Political Sentence-BERT-based Method for QA Lab-PoliInfo-2 Task of NTCIR-15

Daiki Shirafuji, Hiromichi Kameya, Rafal Rzepka, Kenji Araki

arXiv:2010.12077v10.56 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the need for efficient summarization of lengthy political discussions in Japanese, though it appears incremental as it adapts existing methods to a specific domain.

The paper tackled the problem of summarizing Japanese political meeting utterances by creating a Japanese Political Sentence-BERT model, achieving results in the NTCIR-15 QA Lab-PoliInfo-2 task without labeled data.

There are many discussions held during political meetings, and a large number of utterances for various topics is included in their transcripts. We need to read all of them if we want to follow speakers\' intentions or opinions about a given topic. To avoid such a costly and time-consuming process to grasp often longish discussions, NLP researchers work on generating concise summaries of utterances. Summarization subtask in QA Lab-PoliInfo-2 task of the NTCIR-15 addresses this problem for Japanese utterances in assembly minutes, and our team (SKRA) participated in this subtask. As a first step for summarizing utterances, we created a new pre-trained sentence embedding model, i.e. the Japanese Political Sentence-BERT. With this model, we summarize utterances without labelled data. This paper describes our approach to solving the task and discusses its results.

View on arXiv PDF

Similar