LGCVNov 19, 2020

An Experimental Study of Semantic Continuity for Deep Learning Models

arXiv:2011.09789v23 citations
AI Analysis

This work tackles a foundational problem of model stability and reliability for all deep learning practitioners, with broad implications for various downstream applications.

This paper addresses semantic discontinuity in deep learning, where small input changes cause semantic shifts in output. The authors propose a semantic continuity constraint, demonstrating it reduces non-semantic information usage and improves adversarial robustness, interpretability, model transfer, and machine bias.

Deep learning models suffer from the problem of semantic discontinuity: small perturbations in the input space tend to cause semantic-level interference to the model output. We argue that the semantic discontinuity results from these inappropriate training targets and contributes to notorious issues such as adversarial robustness, interpretability, etc. We first conduct data analysis to provide evidence of semantic discontinuity in existing deep learning models, and then design a simple semantic continuity constraint which theoretically enables models to obtain smooth gradients and learn semantic-oriented features. Qualitative and quantitative experiments prove that semantically continuous models successfully reduce the use of non-semantic information, which further contributes to the improvement in adversarial robustness, interpretability, model transfer, and machine bias.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes