LGAIDec 9, 2023

Deeper Understanding of Black-box Predictions via Generalized Influence Functions

arXiv:2312.05586v27 citationsh-index: 7Has Code
Originality Highly original
AI Analysis

This provides a foundational tool for optimizing models and enhancing AI interpretability, addressing a bottleneck in understanding black-box predictions for researchers and practitioners.

The paper tackles the inaccuracy of influence functions in large-scale models by introducing generalized influence functions that precisely estimate target parameters' influence while nullifying nuisance changes, achieving more accurate results than other methods by updating only 5% of the model across tasks.

Influence functions (IFs) elucidate how training data changes model behavior. However, the increasing size and non-convexity in large-scale models make IFs inaccurate. We suspect that the fragility comes from the first-order approximation which may cause nuisance changes in parameters irrelevant to the examined data. However, simply computing influence from the chosen parameters can be misleading, as it fails to nullify the hidden effects of unselected parameters on the analyzed data. Thus, our approach introduces generalized IFs, precisely estimating target parameters' influence while nullifying nuisance gradient changes on fixed parameters. We identify target update parameters closely associated with the input data by the output- and gradient-based parameter selection methods. We verify the generalized IFs with various alternatives of IFs on the class removal and label change tasks. The experiments align with the "less is more" philosophy, demonstrating that updating only 5\% of the model produces more accurate results than other influence functions across all tasks. We believe our proposal works as a foundational tool for optimizing models, conducting data analysis, and enhancing AI interpretability beyond the limitation of IFs. Codes are available at https://github.com/hslyu/GIF.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes