LGAIGNDec 31, 2024

Pan-infection Foundation Framework Enables Multiple Pathogen Prediction

arXiv:2501.01462v1h-index: 10
Originality Highly original
AI Analysis

This work addresses the need for accurate and generalizable diagnostic models to reduce inappropriate antibiotic prescriptions in clinical settings, representing a novel method for a known bottleneck.

The researchers tackled the problem of diagnosing bacterial and viral infections by curating a large host-response transcriptome dataset and building a pan-infection foundation model with an AUC of 0.97, then used knowledge distillation to create lightweight models for specific pathogens with AUCs ranging from 0.93 to 0.99.

Host-response-based diagnostics can improve the accuracy of diagnosing bacterial and viral infections, thereby reducing inappropriate antibiotic prescriptions. However, the existing cohorts with limited sample size and coarse infections types are unable to support the exploration of an accurate and generalizable diagnostic model. Here, we curate the largest infection host-response transcriptome data, including 11,247 samples across 89 blood transcriptome datasets from 13 countries and 21 platforms. We build a diagnostic model for pathogen prediction starting from a pan-infection model as foundation (AUC = 0.97) based on the pan-infection dataset. Then, we utilize knowledge distillation to efficiently transfer the insights from this "teacher" model to four lightweight pathogen "student" models, i.e., staphylococcal infection (AUC = 0.99), streptococcal infection (AUC = 0.94), HIV infection (AUC = 0.93), and RSV infection (AUC = 0.94), as well as a sepsis "student" model (AUC = 0.99). The proposed knowledge distillation framework not only facilitates the diagnosis of pathogens using pan-infection data, but also enables an across-disease study from pan-infection to sepsis. Moreover, the framework enables high-degree lightweight design of diagnostic models, which is expected to be adaptively deployed in clinical settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes