How Domain Terminology Affects Meeting Summarization Performance
This addresses the challenge of improving meeting summarization for organizations dealing with domain-specific content, but it is incremental as it focuses on analyzing an existing bottleneck rather than introducing a new method.
The paper tackled the problem of how domain terminology affects meeting summarization performance by creating gold-standard jargon annotations on a meeting corpus and analyzing system performance with and without these terms, finding that domain terminology has a substantial impact.
Meetings are essential to modern organizations. Numerous meetings are held and recorded daily, more than can ever be comprehended. A meeting summarization system that identifies salient utterances from the transcripts to automatically generate meeting minutes can help. It empowers users to rapidly search and sift through large meeting collections. To date, the impact of domain terminology on the performance of meeting summarization remains understudied, despite that meetings are rich with domain knowledge. In this paper, we create gold-standard annotations for domain terminology on a sizable meeting corpus; they are known as jargon terms. We then analyze the performance of a meeting summarization system with and without jargon terms. Our findings reveal that domain terminology can have a substantial impact on summarization performance. We publicly release all domain terminology to advance research in meeting summarization.