CVOct 15, 2025

Synchronization of Multiple Videos

arXiv:2510.14051v13 citationsh-index: 21Has Code
Originality Incremental advance
AI Analysis

This addresses synchronization issues for multiple generative AI videos depicting the same action, representing an incremental advancement in video processing.

The paper tackles the challenge of synchronizing videos from different scenes or generative AI sources by proposing Temporal Prototype Learning (TPL), which improves synchronization accuracy, efficiency, and robustness across diverse datasets.

Synchronizing videos captured simultaneously from multiple cameras in the same scene is often easy and typically requires only simple time shifts. However, synchronizing videos from different scenes or, more recently, generative AI videos, poses a far more complex challenge due to diverse subjects, backgrounds, and nonlinear temporal misalignment. We propose Temporal Prototype Learning (TPL), a prototype-based framework that constructs a shared, compact 1D representation from high-dimensional embeddings extracted by any of various pretrained models. TPL robustly aligns videos by learning a unified prototype sequence that anchors key action phases, thereby avoiding exhaustive pairwise matching. Our experiments show that TPL improves synchronization accuracy, efficiency, and robustness across diverse datasets, including fine-grained frame retrieval and phase classification tasks. Importantly, TPL is the first approach to mitigate synchronization issues in multiple generative AI videos depicting the same action. Our code and a new multiple video synchronization dataset are available at https://bgu-cs-vil.github.io/TPL/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes