CYAICVFeb 5, 2018

Dream Formulations and Deep Neural Networks: Humanistic Themes in the Iconology of the Machine-Learned Image

arXiv:1802.01274v113 citations
Originality Synthesis-oriented
AI Analysis

It addresses the problem of interpreting machine-learned images for researchers in computer vision and the humanities, proposing interdisciplinary approaches, but it is incremental in applying existing theories to new contexts.

This paper explores the interpretability of deep learning image recognition by comparing it to humanistic theories of visual perception, such as Panofsky's iconology and Rosch's categorization, finding surprising similarities that suggest benefits from arts-sciences collaboration.

This paper addresses the interpretability of deep learning-enabled image recognition processes in computer vision science in relation to theories in art history and cognitive psychology on the vision-related perceptual capabilities of humans. Examination of what is determinable about the machine-learned image in comparison to humanistic theories of visual perception, particularly in regard to art historian Erwin Panofsky's methodology for image analysis and psychologist Eleanor Rosch's theory of graded categorization according to prototypes, finds that there are surprising similarities between the two that suggest that researchers in the arts and the sciences would have much to benefit from closer collaborations. Utilizing the examples of Google's DeepDream and the Machine Learning and Perception Lab at Georgia Tech's Grad-CAM: Gradient-weighted Class Activation Mapping programs, this study suggests that a revival of art historical research in iconography and formalism in the age of AI is essential for shaping the future navigation and interpretation of all machine-learned images, given the rapid developments in image recognition technologies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes