CVAILGSep 25, 2023

UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation

arXiv:2309.14335v118 citationsh-index: 32
Originality Incremental advance
AI Analysis

This work improves human generation for applications like graphics and virtual reality, but it is incremental as it builds on existing GAN frameworks with specific enhancements for data alignment.

The paper tackles the problem of generating high-resolution humans by addressing insufficient local details in holistic datasets, proposing a method that uses multi-source data to achieve superior quality compared to single-dataset approaches.

Human generation has achieved significant progress. Nonetheless, existing methods still struggle to synthesize specific regions such as faces and hands. We argue that the main reason is rooted in the training data. A holistic human dataset inevitably has insufficient and low-resolution information on local parts. Therefore, we propose to use multi-source datasets with various resolution images to jointly learn a high-resolution human generative model. However, multi-source data inherently a) contains different parts that do not spatially align into a coherent human, and b) comes with different scales. To tackle these challenges, we propose an end-to-end framework, UnitedHuman, that empowers continuous GAN with the ability to effectively utilize multi-source data for high-resolution human generation. Specifically, 1) we design a Multi-Source Spatial Transformer that spatially aligns multi-source images to full-body space with a human parametric model. 2) Next, a continuous GAN is proposed with global-structural guidance and CutMix consistency. Patches from different datasets are then sampled and transformed to supervise the training of this scale-invariant generative model. Extensive experiments demonstrate that our model jointly learned from multi-source data achieves superior quality than those learned from a holistic dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes