CLAICVJun 3, 2025

DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization

arXiv:2506.02351v12 citationsProceedings of the 1st Workshop for Research on Agent Language Models (REALM 2025)
Originality Incremental advance
AI Analysis

This work addresses the need for scalable and rich highlight summarization in sports, though it is incremental as it builds on existing methods with a hybrid approach.

The paper tackled the problem of generating context-aware baseball highlight summaries by introducing DIAMOND, an LLM-driven agent that integrates structured sports analytics with natural language reasoning, improving F1-score from 42.9% to 84.8% on Korean Baseball Organization League games.

Traditional approaches -- such as Win Probability Added (WPA)-based ranking or computer vision-driven event detection -- can identify scoring plays but often miss strategic depth, momentum shifts, and storyline progression. Manual curation remains the gold standard but is resource-intensive and not scalable. We introduce DIAMOND, an LLM-driven agent for context-aware baseball highlight summarization that integrates structured sports analytics with natural language reasoning. DIAMOND leverages sabermetric features -- Win Expectancy, WPA, and Leverage Index -- to quantify play importance, while an LLM module enhances selection based on contextual narrative value. This hybrid approach ensures both quantitative rigor and qualitative richness, surpassing the limitations of purely statistical or vision-based systems. Evaluated on five diverse Korean Baseball Organization League games, DIAMOND improves F1-score from 42.9% (WPA-only) to 84.8%, outperforming both commercial and statistical baselines. Though limited in scale, our results highlight the potential of modular, interpretable agent-based frameworks for event-level summarization in sports and beyond.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes