CVAINov 16, 2023

Trustworthy Large Models in Vision: A Survey

arXiv:2311.09680v51 citationsh-index: 28
Originality Synthesis-oriented
AI Analysis

It fills a gap by providing a systematic survey on trustworthy LMs in computer vision, which is incremental as it builds on existing NLP-focused literature.

This survey addresses the problem of untrustworthy behavior in large models (LMs) applied to computer vision, summarizing four key concerns—human misuse, vulnerability, inherent issues, and interpretability—to promote their reliable usage.

The rapid progress of Large Models (LMs) has recently revolutionized various fields of deep learning with remarkable grades, ranging from Natural Language Processing (NLP) to Computer Vision (CV). However, LMs are increasingly challenged and criticized by academia and industry due to their powerful performance but untrustworthy behavior, which urgently needs to be alleviated by reliable methods. Despite the abundance of literature on trustworthy LMs in NLP, a systematic survey specifically delving into the trustworthiness of LMs in CV remains absent. In order to mitigate this gap, we summarize four relevant concerns that obstruct the trustworthy usage in vision of LMs in this survey, including 1) human misuse, 2) vulnerability, 3) inherent issue and 4) interpretability. By highlighting corresponding challenge, countermeasures, and discussion in each topic, we hope this survey will facilitate readers' understanding of this field, promote alignment of LMs with human expectations and enable trustworthy LMs to serve as welfare rather than disaster for human society.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes