CV AI LG NCApr 16, 2020

Shortcut Learning in Deep Neural Networks

Robert Geirhos, Jörn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge, Felix A. Wichmann

arXiv:2004.07780v557.13194 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a critical limitation in AI systems by highlighting a pervasive issue that affects the reliability and transferability of deep learning models across various applications.

The paper identifies shortcut learning as a fundamental problem in deep neural networks, where models rely on superficial decision rules that fail in real-world scenarios, and proposes recommendations for improved benchmarking and robustness.

Deep learning has triggered the current rise of artificial intelligence and is the workhorse of today's machine intelligence. Numerous success stories have rapidly spread all over science, industry and society, but its limitations have only recently come into focus. In this perspective we seek to distill how many of deep learning's problems can be seen as different symptoms of the same underlying problem: shortcut learning. Shortcuts are decision rules that perform well on standard benchmarks but fail to transfer to more challenging testing conditions, such as real-world scenarios. Related issues are known in Comparative Psychology, Education and Linguistics, suggesting that shortcut learning may be a common characteristic of learning systems, biological and artificial alike. Based on these observations, we develop a set of recommendations for model interpretation and benchmarking, highlighting recent advances in machine learning to improve robustness and transferability from the lab to real-world applications.

View on arXiv PDF Code

Similar