LGJun 4

Causal Modeling of Selection in Evolution

Haoyue Dai, Zeyu Tang, Peter Spirtes, Kun Zhang

arXiv:2606.0568943.0

AI Analysis

For researchers in causal discovery and evolutionary biology, this work addresses a previously unrecognized gap in modeling selection processes.

The paper distinguishes between static and evolutionary selection, showing that existing causal discovery methods conflate them and fail under evolution. It introduces a new model for evolutionary selection and a procedure to identify it from data, validated experimentally.

Understanding potential selection in data is crucial for causal discovery; we argue that "selection" in common narratives takes two forms, which we term static and evolutionary selection, respectively. Static selection refers to a one-shot filtering process where observed data consist of a subset of the population of interest, as in survey volunteer bias. Evolutionary selection, in contrast, operates through repeated rounds of differential fitness in reproduction, where observed data constitute the latest generation shaped by a historical trajectory, as in immune adaptation, antibiotic resistance, and social norm emergence. Existing methods largely conflate these two forms and rely on an identical graphical model of selection. We show that this model is valid for static settings but fails to characterize data under evolution, yielding false discovery results. To address this, we introduce a new model that specifically characterizes evolutionary selection, and develop a sound and complete procedure for identifying such models from data across one or multiple environments or generations. Experimental results validate the method's ability to uncover the relevant mechanisms underlying evolution from data.

View on arXiv PDF

Similar