CVJun 24, 2019

GANalyze: Toward Visual Definitions of Cognitive Image Properties

arXiv:1906.10112v1348 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses the lack of concrete visual definitions for cognitive image properties, which is a problem for researchers in psychology and computer vision, though it is incremental as it applies existing GAN methods to a new domain.

The paper tackles the problem of defining cognitive image properties like memorability, aesthetics, and emotional valence by using GANs to generate visual definitions, and it verifies through behavioral experiments that the discovered manipulations causally affect human memory performance.

We introduce a framework that uses Generative Adversarial Networks (GANs) to study cognitive properties like memorability, aesthetics, and emotional valence. These attributes are of interest because we do not have a concrete visual definition of what they entail. What does it look like for a dog to be more or less memorable? GANs allow us to generate a manifold of natural-looking images with fine-grained differences in their visual attributes. By navigating this manifold in directions that increase memorability, we can visualize what it looks like for a particular generated image to become more or less memorable. The resulting ``visual definitions" surface image properties (like ``object size") that may underlie memorability. Through behavioral experiments, we verify that our method indeed discovers image manipulations that causally affect human memory performance. We further demonstrate that the same framework can be used to analyze image aesthetics and emotional valence. Visit the GANalyze website at http://ganalyze.csail.mit.edu/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes