AICLJun 11, 2024

Large Language Models for Constrained-Based Causal Discovery

arXiv:2406.07378v126 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of causal discovery in complex systems like the economy or brain, offering a potential complementary tool for researchers, though it appears incremental as it builds on existing methods like the PC algorithm.

The paper tackles the problem of generating causal graphs by using Large Language Models (LLMs) as an alternative to domain experts, framing conditional independence queries as prompts and applying the PC algorithm, with results showing improved performance through a voting schema that controls error rates.

Causality is essential for understanding complex systems, such as the economy, the brain, and the climate. Constructing causal graphs often relies on either data-driven or expert-driven approaches, both fraught with challenges. The former methods, like the celebrated PC algorithm, face issues with data requirements and assumptions of causal sufficiency, while the latter demand substantial time and domain knowledge. This work explores the capabilities of Large Language Models (LLMs) as an alternative to domain experts for causal graph generation. We frame conditional independence queries as prompts to LLMs and employ the PC algorithm with the answers. The performance of the LLM-based conditional independence oracle on systems with known causal graphs shows a high degree of variability. We improve the performance through a proposed statistical-inspired voting schema that allows some control over false-positive and false-negative rates. Inspecting the chain-of-thought argumentation, we find causal reasoning to justify its answer to a probabilistic query. We show evidence that knowledge-based CIT could eventually become a complementary tool for data-driven causal discovery.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes