CVAIDec 6, 2024

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

arXiv:2412.05355v19 citationsh-index: 11
Originality Highly original
AI Analysis

This work addresses motion transfer in video generation for AI and creative applications, presenting a novel method for a known bottleneck.

The authors tackled the problem of zero-shot motion transfer in video diffusion models by proposing Mixture of Score Guidance (MSG), which decomposes motion and content scores to preserve scene composition and enable creative transformations, achieving successful handling of diverse scenarios including single/multiple objects and complex camera motions.

In this work, we propose the first motion transfer approach in diffusion transformer through Mixture of Score Guidance (MSG), a theoretically-grounded framework for motion transfer in diffusion models. Our key theoretical contribution lies in reformulating conditional score to decompose motion score and content score in diffusion models. By formulating motion transfer as a mixture of potential energies, MSG naturally preserves scene composition and enables creative scene transformations while maintaining the integrity of transferred motion patterns. This novel sampling operates directly on pre-trained video diffusion models without additional training or fine-tuning. Through extensive experiments, MSG demonstrates successful handling of diverse scenarios including single object, multiple objects, and cross-object motion transfer as well as complex camera motion transfer. Additionally, we introduce MotionBench, the first motion transfer dataset consisting of 200 source videos and 1000 transferred motions, covering single/multi-object transfers, and complex camera motions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes