MEAIMSEMNov 9, 2020

DoWhy: An End-to-End Library for Causal Inference

arXiv:2011.04216v1261 citationsHas Code
AI Analysis

This library addresses the need for more comprehensive causal inference tools in data science and research, though it is incremental as it builds on existing frameworks like causal graphs.

The authors tackled the challenge of causal inference by developing DoWhy, an open-source Python library that integrates causal assumptions as first-class citizens, enabling users to model, identify, estimate, and refute causal effects through a structured four-step API, with features like robustness checks for unobserved confounding.

In addition to efficient statistical estimators of a treatment's effect, successful application of causal inference requires specifying assumptions about the mechanisms underlying observed data and testing whether they are valid, and to what extent. However, most libraries for causal inference focus only on the task of providing powerful statistical estimators. We describe DoWhy, an open-source Python library that is built with causal assumptions as its first-class citizens, based on the formal framework of causal graphs to specify and test causal assumptions. DoWhy presents an API for the four steps common to any causal analysis---1) modeling the data using a causal graph and structural assumptions, 2) identifying whether the desired effect is estimable under the causal model, 3) estimating the effect using statistical estimators, and finally 4) refuting the obtained estimate through robustness checks and sensitivity analyses. In particular, DoWhy implements a number of robustness checks including placebo tests, bootstrap tests, and tests for unoberved confounding. DoWhy is an extensible library that supports interoperability with other implementations, such as EconML and CausalML for the the estimation step. The library is available at https://github.com/microsoft/dowhy

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes