LG MLOct 28, 2024

LLM-initialized Differentiable Causal Discovery

Shiv Kampani, David Hidary, Constantijn van der Poel, Martin Ganahl, Brenda Miao

arXiv:2410.21141v110.46 citationsh-index: 1

Originality Incremental advance

AI Analysis

This work addresses the problem of limited interpretability and integration of prior knowledge in causal discovery for scientific domains, representing an incremental improvement by combining LLM priors with DCD methods.

The paper tackles the challenge of improving interpretability and incorporating domain knowledge in differentiable causal discovery (DCD) by proposing LLM-DCD, which uses a Large Language Model (LLM) to initialize the optimization of DCD's maximum likelihood objective, resulting in higher accuracy on benchmarking datasets compared to state-of-the-art alternatives.

The discovery of causal relationships between random variables is an important yet challenging problem that has applications across many scientific domains. Differentiable causal discovery (DCD) methods are effective in uncovering causal relationships from observational data; however, these approaches often suffer from limited interpretability and face challenges in incorporating domain-specific prior knowledge. In contrast, Large Language Models (LLMs)-based causal discovery approaches have recently been shown capable of providing useful priors for causal discovery but struggle with formal causal reasoning. In this paper, we propose LLM-DCD, which uses an LLM to initialize the optimization of the maximum likelihood objective function of DCD approaches, thereby incorporating strong priors into the discovery method. To achieve this initialization, we design our objective function to depend on an explicitly defined adjacency matrix of the causal graph as its only variational parameter. Directly optimizing the explicitly defined adjacency matrix provides a more interpretable approach to causal discovery. Additionally, we demonstrate higher accuracy on key benchmarking datasets of our approach compared to state-of-the-art alternatives, and provide empirical evidence that the quality of the initialization directly impacts the quality of the final output of our DCD approach. LLM-DCD opens up new opportunities for traditional causal discovery methods like DCD to benefit from future improvements in the causal reasoning capabilities of LLMs.

View on arXiv PDF

Similar