CV GR LGFeb 4, 2025

Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts

Feng Liang, Haoyu Ma, Zecheng He, Tingbo Hou, Ji Hou, Kunpeng Li, Xiaoliang Dai, Felix Juefei-Xu, Samaneh Azadi, Animesh Sinha, Peizhao Zhang, Peter Vajda

arXiv:2502.07802v124.822 citationsh-index: 26CVPR

Originality Incremental advance

AI Analysis

This work addresses the limitation of single-concept personalization in video generation, enabling broader applications like combining faces, bodies, and animals, though it is incremental as it builds on existing personalization techniques.

The paper tackles the problem of multi-concept video personalization, where prior methods suffer from identity blending when integrating multiple reference images, and introduces anchored prompts and concept embeddings to achieve accurate referencing and order encoding, resulting in improved identity preservation and overall quality compared to existing methods.

Video personalization, which generates customized videos using reference images, has gained significant attention. However, prior methods typically focus on single-concept personalization, limiting broader applications that require multi-concept integration. Attempts to extend these models to multiple concepts often lead to identity blending, which results in composite characters with fused attributes from multiple sources. This challenge arises due to the lack of a mechanism to link each concept with its specific reference image. We address this with anchored prompts, which embed image anchors as unique tokens within text prompts, guiding accurate referencing during generation. Additionally, we introduce concept embeddings to encode the order of reference images. Our approach, Movie Weaver, seamlessly weaves multiple concepts-including face, body, and animal images-into one video, allowing flexible combinations in a single model. The evaluation shows that Movie Weaver outperforms existing methods for multi-concept video personalization in identity preservation and overall quality.

View on arXiv PDF

Similar