CVOct 30, 2024

LumiSculpt: Enabling Consistent Portrait Lighting in Video Generation

Yuxin Zhang, Dandan Zheng, Biao Gong, Shiwen Wang, Jingdong Chen, Ming Yang, Weiming Dong, Changsheng Xu

arXiv:2410.22979v25 citationsh-index: 17

Originality Incremental advance

AI Analysis

This addresses the problem of limited lighting control in video generation for applications like portrait videos, though it appears incremental by building on existing T2I models.

The paper tackles the challenge of disentangling and modeling coherent lighting conditions in video generation by proposing LumiSculpt, which enables precise and consistent lighting control in text-to-video models, achieving high-quality results as demonstrated experimentally.

Lighting plays a pivotal role in ensuring the naturalness and aesthetic quality of video generation. However, the impact of lighting is deeply coupled with other factors of videos, e.g., objects and scenes. Thus, it remains challenging to disentangle and model coherent lighting conditions independently, limiting the flexibility to control lighting in video generation. In this paper, inspired by the established controllable T2I models, we propose LumiSculpt, which enables precise and consistent lighting control in T2V generation models. LumiSculpt equips the video generation with new interactive capabilities, allowing the input of reference image sequences with customized lighting conditions. Furthermore, the core learnable plug-and-play module of LumiSculpt facilitates direct control over the intensity, position and trajectory of an assumed light source in video diffusion models. To effectively train LumiSculpt and address the issue of insufficient lighting data, we construct LumiHuman, a new lightweight and flexible dataset for portrait lighting of images and videos. Experimental results demonstrate that LumiSculpt achieves precise and high-quality lighting control in video generation. The analysis demonstrates the flexibility of LumiHuman.

View on arXiv PDF

Similar