CLAIFeb 5, 2025

MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf

arXiv:2502.04376v12 citationsh-index: 28Proceedings of the Fourth Workshop on Bridging Human-Computer Interaction and Natural Language Processing (HCI+NLP)
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of time-consuming and inefficient meetings for workplace teams, but it is incremental as it benchmarks existing LLMs without introducing new methods.

The paper tackled the problem of whether LLMs can effectively delegate participants in meetings by developing a prototype system and benchmark using real meeting transcripts, finding that about 60% of responses addressed key points but improvements are needed for irrelevant content and transcription errors.

In contemporary workplaces, meetings are essential for exchanging ideas and ensuring team alignment but often face challenges such as time consumption, scheduling conflicts, and inefficient participation. Recent advancements in Large Language Models (LLMs) have demonstrated their strong capabilities in natural language generation and reasoning, prompting the question: can LLMs effectively delegate participants in meetings? To explore this, we develop a prototype LLM-powered meeting delegate system and create a comprehensive benchmark using real meeting transcripts. Our evaluation reveals that GPT-4/4o maintain balanced performance between active and cautious engagement strategies. In contrast, Gemini 1.5 Pro tends to be more cautious, while Gemini 1.5 Flash and Llama3-8B/70B display more active tendencies. Overall, about 60\% of responses address at least one key point from the ground-truth. However, improvements are needed to reduce irrelevant or repetitive content and enhance tolerance for transcription errors commonly found in real-world settings. Additionally, we implement the system in practical settings and collect real-world feedback from demos. Our findings underscore the potential and challenges of utilizing LLMs as meeting delegates, offering valuable insights into their practical application for alleviating the burden of meetings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes