LGAIMLOct 15, 2024

A Theoretical Survey on Foundation Models

arXiv:2410.11444v21 citationsh-index: 7
Originality Synthesis-oriented
AI Analysis

It tackles the problem of interpreting black-box foundation models for AI researchers and practitioners, but is incremental as it surveys existing methods rather than introducing new ones.

This survey reviews interpretable methods for understanding foundation models, addressing limitations of existing explainable approaches by focusing on faithfulness and resource efficiency, and identifies future research directions based on these interpretations.

Understanding the inner mechanisms of black-box foundation models (FMs) is essential yet challenging in artificial intelligence and its applications. Over the last decade, the long-running focus has been on their explainability, leading to the development of post-hoc explainable methods to rationalize the specific decisions already made by black-box FMs. However, these explainable methods have certain limitations in terms of faithfulness and resource requirement. Consequently, a new class of interpretable methods should be considered to unveil the underlying mechanisms of FMs in an accurate, comprehensive, heuristic, and resource-light way. This survey aims to review those interpretable methods that comply with the aforementioned principles and have been successfully applied to FMs. These methods are deeply rooted in machine learning theory, covering the analysis of generalization performance, expressive capability, and dynamic behavior. They provide a thorough interpretation of the entire workflow of FMs, ranging from the inference capability and training dynamics to their ethical implications. Ultimately, drawing upon these interpretations, this review identifies the next frontier research directions for FMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes