CVAIDec 11, 2025

SoccerMaster: A Vision Foundation Model for Soccer Understanding

arXiv:2512.11016v16 citations
Originality Incremental advance
AI Analysis

This work addresses the domain-specific complexity of soccer understanding for researchers and practitioners, offering a scalable and superior alternative to isolated models.

The paper tackles the problem of soccer visual understanding by proposing SoccerMaster, a unified vision foundation model that outperforms task-specific expert models across diverse downstream tasks, such as athlete detection and event classification.

Soccer understanding has recently garnered growing research interest due to its domain-specific complexity and unique challenges. Unlike prior works that typically rely on isolated, task-specific expert models, this work aims to propose a unified model to handle diverse soccer visual understanding tasks, ranging from fine-grained perception (e.g., athlete detection) to semantic reasoning (e.g., event classification). Specifically, our contributions are threefold: (i) we present SoccerMaster, the first soccer-specific vision foundation model that unifies diverse understanding tasks within a single framework via supervised multi-task pretraining; (ii) we develop an automated data curation pipeline to generate scalable spatial annotations, and integrate them with various existing soccer video datasets to construct SoccerFactory, a comprehensive pretraining data resource; and (iii) we conduct extensive evaluations demonstrating that SoccerMaster consistently outperforms task-specific expert models across diverse downstream tasks, highlighting its breadth and superiority. The data, code, and model will be publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes