AICLCVLGSCJun 25, 2024

Large Language Models are Interpretable Learners

arXiv:2406.17224v18 citations
Originality Highly original
AI Analysis

This addresses the problem of building human-centric predictive models that are both accurate and interpretable, offering a novel approach to bridge the gap between symbolic rules and neural networks.

The paper tackles the trade-off between expressiveness and interpretability in predictive models by combining Large Language Models (LLMs) with symbolic programs, resulting in LLM-based Symbolic Programs (LSPs) that achieve superior performance compared to traditional neurosymbolic programs and vanilla automatic prompt tuning methods.

The trade-off between expressiveness and interpretability remains a core challenge when building human-centric predictive models for classification and decision-making. While symbolic rules offer interpretability, they often lack expressiveness, whereas neural networks excel in performance but are known for being black boxes. In this paper, we show a combination of Large Language Models (LLMs) and symbolic programs can bridge this gap. In the proposed LLM-based Symbolic Programs (LSPs), the pretrained LLM with natural language prompts provides a massive set of interpretable modules that can transform raw input into natural language concepts. Symbolic programs then integrate these modules into an interpretable decision rule. To train LSPs, we develop a divide-and-conquer approach to incrementally build the program from scratch, where the learning process of each step is guided by LLMs. To evaluate the effectiveness of LSPs in extracting interpretable and accurate knowledge from data, we introduce IL-Bench, a collection of diverse tasks, including both synthetic and real-world scenarios across different modalities. Empirical results demonstrate LSP's superior performance compared to traditional neurosymbolic programs and vanilla automatic prompt tuning methods. Moreover, as the knowledge learned by LSP is a combination of natural language descriptions and symbolic rules, it is easily transferable to humans (interpretable), and other LLMs, and generalizes well to out-of-distribution samples.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes