CVMar 25, 2024

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

arXiv:2403.17008v132 citationsh-index: 12
Originality Incremental advance
AI Analysis

This work provides a practical tool for users to customize photos with improved identity accuracy, though it is incremental in enhancing existing human photo personalization techniques.

The authors tackled the problem of personalizing human photos with high-fidelity identity preservation using reference face images and text prompts, achieving better detail retention and instruction following compared to prior methods.

This work presents FlashFace, a practical tool with which users can easily personalize their own photos on the fly by providing one or a few reference face images and a text prompt. Our approach is distinguishable from existing human photo customization methods by higher-fidelity identity preservation and better instruction following, benefiting from two subtle designs. First, we encode the face identity into a series of feature maps instead of one image token as in prior arts, allowing the model to retain more details of the reference faces (e.g., scars, tattoos, and face shape ). Second, we introduce a disentangled integration strategy to balance the text and image guidance during the text-to-image generation process, alleviating the conflict between the reference faces and the text prompts (e.g., personalizing an adult into a "child" or an "elder"). Extensive experimental results demonstrate the effectiveness of our method on various applications, including human image personalization, face swapping under language prompts, making virtual characters into real people, etc. Project Page: https://jshilong.github.io/flashface-page.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes