CVJun 24, 2024

FaceScore: Benchmarking and Enhancing Face Quality in Human Generation

arXiv:2406.17100v29 citationsHas Code
AI Analysis

This addresses a specific bottleneck in diffusion models for practical applications like image generation, but it is incremental as it builds on existing methods and metrics.

The paper tackles the problem of low-quality, unrealistic human faces in text-to-image generation by diffusion models, developing a novel metric called FaceScore that aligns better with human judgments and using it to enhance face generation through preference learning, resulting in improved face quality as verified in experiments.

Diffusion models (DMs) have achieved significant success in generating imaginative images given textual descriptions. However, they are likely to fall short when it comes to real-life scenarios with intricate details. The low-quality, unrealistic human faces in text-to-image generation are one of the most prominent issues, hindering the wide application of DMs in practice. Targeting addressing such an issue, we first assess the face quality of generations from popular pre-trained DMs with the aid of human annotators and then evaluate the alignment between existing metrics with human judgments. Observing that existing metrics can be unsatisfactory for quantifying face quality, we develop a novel metric named FaceScore (FS) by fine-tuning the widely used ImageReward on a dataset of (win, loss) face pairs cheaply crafted by an inpainting pipeline of DMs. Extensive studies reveal FS enjoys a superior alignment with humans. On the other hand, FS opens up the door for enhancing DMs for better face generation. With FS offering image ratings, we can easily perform preference learning algorithms to refine DMs like SDXL. Comprehensive experiments verify the efficacy of our approach for improving face quality. The code is released at https://github.com/OPPO-Mente-Lab/FaceScore.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes