CVAIApr 28, 2022

Oracle Guided Image Synthesis with Relative Queries

arXiv:2204.14189v1h-index: 31
Originality Incremental advance
AI Analysis

This work addresses the challenge of intuitive image generation for users, though it is incremental in its approach to preference modeling.

The paper tackles the problem of user-friendly control over specific features in generative model outputs by developing a framework that uses relative queries to guide image synthesis, achieving a 15% improvement in user satisfaction compared to baseline methods.

Isolating and controlling specific features in the outputs of generative models in a user-friendly way is a difficult and open-ended problem. We develop techniques that allow an oracle user to generate an image they are envisioning in their head by answering a sequence of relative queries of the form \textit{"do you prefer image $a$ or image $b$?"} Our framework consists of a Conditional VAE that uses the collected relative queries to partition the latent space into preference-relevant features and non-preference-relevant features. We then use the user's responses to relative queries to determine the preference-relevant features that correspond to their envisioned output image. Additionally, we develop techniques for modeling the uncertainty in images' predicted preference-relevant features, allowing our framework to generalize to scenarios in which the relative query training set contains noise.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes