LGNov 21, 2022

Addressing Mistake Severity in Neural Networks with Semantic Knowledge

arXiv:2211.11880v14 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the issue of trustworthiness for embodied agents in dynamic environments, though it is incremental as it builds on existing adversarial training methods.

The paper tackles the problem of reducing the severity of mistakes made by neural networks in challenging conditions, such as out-of-distribution inputs, by leveraging adversarial training to increase semantic similarity between predictions and true labels for misclassified instances, resulting in better performance in mistake severity compared to standard and adversarially trained models.

Robustness in deep neural networks and machine learning algorithms in general is an open research challenge. In particular, it is difficult to ensure algorithmic performance is maintained on out-of-distribution inputs or anomalous instances that cannot be anticipated at training time. Embodied agents will be deployed in these conditions, and are likely to make incorrect predictions. An agent will be viewed as untrustworthy unless it can maintain its performance in dynamic environments. Most robust training techniques aim to improve model accuracy on perturbed inputs; as an alternate form of robustness, we aim to reduce the severity of mistakes made by neural networks in challenging conditions. We leverage current adversarial training methods to generate targeted adversarial attacks during the training process in order to increase the semantic similarity between a model's predictions and true labels of misclassified instances. Results demonstrate that our approach performs better with respect to mistake severity compared to standard and adversarially trained models. We also find an intriguing role that non-robust features play with regards to semantic similarity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes