CVAIGRLGJul 13, 2023

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Microsoft
arXiv:2307.06949v2259 citationsh-index: 35
AI Analysis

This addresses the time and storage bottlenecks for users needing fast, efficient personalization of AI-generated images, representing a strong incremental improvement over existing methods.

The paper tackles the problem of slow and memory-intensive personalization of text-to-image models by proposing HyperDreamBooth, a hypernetwork that generates personalized weights from a single image, achieving personalization in roughly 20 seconds (25x faster than DreamBooth) and reducing model size by 10,000x while maintaining quality.

Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and storing a personalized model per subject can be demanding in terms of storage capacity. To overcome these challenges, we propose HyperDreamBooth - a hypernetwork capable of efficiently generating a small set of personalized weights from a single image of a person. By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications. Our method achieves personalization on faces in roughly 20 seconds, 25x faster than DreamBooth and 125x faster than Textual Inversion, using as few as one reference image, with the same quality and style diversity as DreamBooth. Also our method yields a model that is 10,000x smaller than a normal DreamBooth model. Project page: https://hyperdreambooth.github.io

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes