LGCLJun 24, 2021

Unsupervised Topic Segmentation of Meetings with BERT Embeddings

arXiv:2106.12978v161 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of topic segmentation in meetings, which is important for applications like summarization and analysis, but the approach is incremental as it builds on previous unsupervised methods with pre-trained models.

The paper tackled the problem of topic segmentation for meeting transcripts by introducing an unsupervised approach using BERT embeddings, achieving a 15.5% reduction in error rate over existing unsupervised methods on two datasets.

Topic segmentation of meetings is the task of dividing multi-person meeting transcripts into topic blocks. Supervised approaches to the problem have proven intractable due to the difficulties in collecting and accurately annotating large datasets. In this paper we show how previous unsupervised topic segmentation methods can be improved using pre-trained neural architectures. We introduce an unsupervised approach based on BERT embeddings that achieves a 15.5% reduction in error rate over existing unsupervised approaches applied to two popular datasets for meeting transcripts.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes