LGQMMay 20, 2025

TxPert: Leveraging Biochemical Relationships for Out-of-Distribution Transcriptomic Perturbation Prediction

arXiv:2505.14919v112 citationsh-index: 18Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of generalizing perturbation predictions for disease understanding and therapy design, representing an incremental improvement with specific gains in OOD settings.

The paper tackles the problem of predicting cellular responses to genetic perturbations under out-of-distribution scenarios, such as unseen single or double perturbations and cell lines, by introducing TxPert, a method that leverages biochemical knowledge graphs to achieve state-of-the-art performance.

Accurately predicting cellular responses to genetic perturbations is essential for understanding disease mechanisms and designing effective therapies. Yet exhaustively exploring the space of possible perturbations (e.g., multi-gene perturbations or across tissues and cell types) is prohibitively expensive, motivating methods that can generalize to unseen conditions. In this work, we explore how knowledge graphs of gene-gene relationships can improve out-of-distribution (OOD) prediction across three challenging settings: unseen single perturbations; unseen double perturbations; and unseen cell lines. In particular, we present: (i) TxPert, a new state-of-the-art method that leverages multiple biological knowledge networks to predict transcriptional responses under OOD scenarios; (ii) an in-depth analysis demonstrating the impact of graphs, model architecture, and data on performance; and (iii) an expanded benchmarking framework that strengthens evaluation standards for perturbation modeling.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes