OTLGAPMLSep 18, 2019

Uncovering Sociological Effect Heterogeneity using Machine Learning

arXiv:1909.09138v16 citations
Originality Synthesis-oriented
AI Analysis

This addresses the challenge for sociologists in discovering new subgroups for effect heterogeneity, though it is incremental as it adapts existing machine learning methods to sociological data.

The paper tackles the problem of identifying subgroups with varying treatment effects in sociology by using causal trees with honest estimation and matching, applied to college effects on wages, and finds that this data-driven approach can uncover meaningful heterogeneity beyond conventional methods.

Individuals do not respond uniformly to treatments, events, or interventions. Sociologists routinely partition samples into subgroups to explore how the effects of treatments vary by covariates like race, gender, and socioeconomic status. In so doing, analysts determine the key subpopulations based on theoretical priors. Data-driven discoveries are also routine, yet the analyses by which sociologists typically go about them are problematic and seldom move us beyond our expectations, and biases, to explore new meaningful subgroups. Emerging machine learning methods allow researchers to explore sources of variation that they may not have previously considered, or envisaged. In this paper, we use causal trees to recursively partition the sample and uncover sources of treatment effect heterogeneity. We use honest estimation, splitting the sample into a training sample to grow the tree and an estimation sample to estimate leaf-specific effects. Assessing a central topic in the social inequality literature, college effects on wages, we compare what we learn from conventional approaches for exploring variation in effects to causal trees. Given our use of observational data, we use leaf-specific matching and sensitivity analyses to address confounding and offer interpretations of effects based on observed and unobserved heterogeneity. We encourage researchers to follow similar practices in their work on variation in sociological effects.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes