AI HCFeb 20, 2023

Affect-Conditioned Image Generation

Francisco Ibarrola, Rohan Lulham, Kazjon Grace

arXiv:2302.09742v15.45 citationsh-index: 18Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge for creators in computational co-creativity contexts who struggle to convey nuanced affective language in text prompts, offering a novel approach to enhance control over image generation.

The paper tackles the difficulty of generating images that evoke specific affective impressions using text-to-image models by introducing a method for conditioning image generation on desired affect, quantified through a psychometrically validated three-component approach, and demonstrates its application across various generative models with quantitative and qualitative analysis.

In creativity support and computational co-creativity contexts, the task of discovering appropriate prompts for use with text-to-image generative models remains difficult. In many cases the creator wishes to evoke a certain impression with the image, but the task of conferring that succinctly in a text prompt poses a challenge: affective language is nuanced, complex, and model-specific. In this work we introduce a method for generating images conditioned on desired affect, quantified using a psychometrically validated three-component approach, that can be combined with conditioning on text descriptions. We first train a neural network for estimating the affect content of text and images from semantic embeddings, and then demonstrate how this can be used to exert control over a variety of generative models. We show examples of how affect modifies the outputs, provide quantitative and qualitative analysis of its capabilities, and discuss possible extensions and use cases.

View on arXiv PDF Code

Similar