CV LGAug 22, 2024

Finding Closure: A Closer Look at the Gestalt Law of Closure in Convolutional Neural Networks

Yuyan Zhang, Derya Soydaner, Lisa Koßmann, Fatemeh Behrad, Johan Wagemans

arXiv:2408.12460v15.23 citationsh-index: 9

Originality Incremental advance

AI Analysis

This work addresses the problem of understanding neural network interpretability and comparability to human vision for researchers in AI and psychology, though it is incremental as it builds on prior studies with a more systematic approach.

The paper investigates whether convolutional neural networks (CNNs) rely on the Gestalt law of closure, similar to human visual perception, by testing various CNNs on curated datasets for modal and amodal completion. The results show that VGG16 and DenseNet-121 exhibit the closure effect, while other CNNs yield variable outcomes.

The human brain has an inherent ability to fill in gaps to perceive figures as complete wholes, even when parts are missing or fragmented. This phenomenon is known as Closure in psychology, one of the Gestalt laws of perceptual organization, explaining how the human brain interprets visual stimuli. Given the importance of Closure for human object recognition, we investigate whether neural networks rely on a similar mechanism. Exploring this crucial human visual skill in neural networks has the potential to highlight their comparability to humans. Recent studies have examined the Closure effect in neural networks. However, they typically focus on a limited selection of Convolutional Neural Networks (CNNs) and have not reached a consensus on their capability to perform Closure. To address these gaps, we present a systematic framework for investigating the Closure principle in neural networks. We introduce well-curated datasets designed to test for Closure effects, including both modal and amodal completion. We then conduct experiments on various CNNs employing different measurements. Our comprehensive analysis reveals that VGG16 and DenseNet-121 exhibit the Closure effect, while other CNNs show variable results. We interpret these findings by blending insights from psychology and neural network research, offering a unique perspective that enhances transparency in understanding neural networks. Our code and dataset will be made available on GitHub.

View on arXiv PDF

Similar