CVMar 30, 2019

Evaluating CNNs on the Gestalt Principle of Closure

Gregor Ehrensperger, Sebastian Stabinger, Antonio Rodríguez Sánchez

arXiv:1904.00285v12.67 citations

Originality Synthesis-oriented

AI Analysis

This addresses the problem of understanding CNNs' visual perception capabilities compared to humans, but it is incremental as it applies existing methods to new datasets.

The paper evaluated AlexNet and GoogLeNet on classifying Kanizsa triangles based on the Gestalt principle of closure, finding that this task is challenging for these CNNs but they show some adaptation to the effect.

Deep convolutional neural networks (CNNs) are widely known for their outstanding performance in classification and regression tasks over high-dimensional data. This made them a popular and powerful tool for a large variety of applications in industry and academia. Recent publications show that seemingly easy classifaction tasks (for humans) can be very challenging for state of the art CNNs. An attempt to describe how humans perceive visual elements is given by the Gestalt principles. In this paper we evaluate AlexNet and GoogLeNet regarding their performance on classifying the correctness of the well known Kanizsa triangles, which heavily rely on the Gestalt principle of closure. Therefore we created various datasets containing valid as well as invalid variants of the Kanizsa triangle. Our findings suggest that perceiving objects by utilizing the principle of closure is very challenging for the applied network architectures but they appear to adapt to the effect of closure.

View on arXiv PDF

Similar