LGFeb 15

In Transformer We Trust? A Perspective on Transformer Architecture Failure Modes

arXiv:2602.14318v1
Originality Synthesis-oriented
AI Analysis

This work addresses the need for understanding transformer reliability in safety-critical domains, but it is incremental as it synthesizes existing insights rather than introducing new methods.

The paper critically examines the trustworthiness of transformer models by reviewing their reliability in interpretability, robustness, fairness, and privacy across high-stakes applications like healthcare and autonomous systems, identifying recurring vulnerabilities and open challenges.

Transformer architectures have revolutionized machine learning across a wide range of domains, from natural language processing to scientific computing. However, their growing deployment in high-stakes applications, such as computer vision, natural language processing, healthcare, autonomous systems, and critical areas of scientific computing including climate modeling, materials discovery, drug discovery, nuclear science, and robotics, necessitates a deeper and more rigorous understanding of their trustworthiness. In this work, we critically examine the foundational question: \textitHow trustworthy are transformer models?} We evaluate their reliability through a comprehensive review of interpretability, explainability, robustness against adversarial attacks, fairness, and privacy. We systematically examine the trustworthiness of transformer-based models in safety-critical applications spanning natural language processing, computer vision, and science and engineering domains, including robotics, medicine, earth sciences, materials science, fluid dynamics, nuclear science, and automated theorem proving; highlighting high-impact areas where these architectures are central and analyzing the risks associated with their deployment. By synthesizing insights across these diverse areas, we identify recurring structural vulnerabilities, domain-specific risks, and open research challenges that limit the reliable deployment of transformers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes