AI CL CV LG SCJun 25, 2024

Large Language Models are Interpretable Learners

Ruochen Wang, Si Si, Felix Yu, Dorothea Wiesmann, Cho-Jui Hsieh, Inderjit Dhillon

arXiv:2406.17224v110.79 citationsHas Code

Originality Highly original

AI Analysis

This addresses the problem of building human-centric predictive models that are both accurate and interpretable, offering a novel approach to bridge the gap between symbolic rules and neural networks.

The paper tackles the trade-off between expressiveness and interpretability in predictive models by combining Large Language Models (LLMs) with symbolic programs, resulting in LLM-based Symbolic Programs (LSPs) that achieve superior performance compared to traditional neurosymbolic programs and vanilla automatic prompt tuning methods.

The trade-off between expressiveness and interpretability remains a core challenge when building human-centric predictive models for classification and decision-making. While symbolic rules offer interpretability, they often lack expressiveness, whereas neural networks excel in performance but are known for being black boxes. In this paper, we show a combination of Large Language Models (LLMs) and symbolic programs can bridge this gap. In the proposed LLM-based Symbolic Programs (LSPs), the pretrained LLM with natural language prompts provides a massive set of interpretable modules that can transform raw input into natural language concepts. Symbolic programs then integrate these modules into an interpretable decision rule. To train LSPs, we develop a divide-and-conquer approach to incrementally build the program from scratch, where the learning process of each step is guided by LLMs. To evaluate the effectiveness of LSPs in extracting interpretable and accurate knowledge from data, we introduce IL-Bench, a collection of diverse tasks, including both synthetic and real-world scenarios across different modalities. Empirical results demonstrate LSP's superior performance compared to traditional neurosymbolic programs and vanilla automatic prompt tuning methods. Moreover, as the knowledge learned by LSP is a combination of natural language descriptions and symbolic rules, it is easily transferable to humans (interpretable), and other LLMs, and generalizes well to out-of-distribution samples.

View on arXiv PDF Code

Similar