CVSep 3, 2025

RTGMFF: Enhanced fMRI-based Brain Disorder Diagnosis via ROI-driven Text Generation and Multimodal Feature Fusion

arXiv:2509.03214v13 citationsh-index: 12Has CodeBIBM
Originality Incremental advance
AI Analysis

This work addresses the problem of improving clinical diagnosis of brain disorders like ADHD and autism for patients and clinicians, representing an incremental advance by integrating text generation and frequency-spatial encoding into existing multimodal approaches.

The paper tackles the challenge of reliable fMRI-based brain disorder diagnosis by introducing RTGMFF, a framework that combines ROI-driven text generation with multimodal feature fusion, achieving notable gains in diagnostic accuracy, sensitivity, specificity, and area under the ROC curve on ADHD-200 and ABIDE benchmarks.

Functional magnetic resonance imaging (fMRI) is a powerful tool for probing brain function, yet reliable clinical diagnosis is hampered by low signal-to-noise ratios, inter-subject variability, and the limited frequency awareness of prevailing CNN- and Transformer-based models. Moreover, most fMRI datasets lack textual annotations that could contextualize regional activation and connectivity patterns. We introduce RTGMFF, a framework that unifies automatic ROI-level text generation with multimodal feature fusion for brain-disorder diagnosis. RTGMFF consists of three components: (i) ROI-driven fMRI text generation deterministically condenses each subject's activation, connectivity, age, and sex into reproducible text tokens; (ii) Hybrid frequency-spatial encoder fuses a hierarchical wavelet-mamba branch with a cross-scale Transformer encoder to capture frequency-domain structure alongside long-range spatial dependencies; and (iii) Adaptive semantic alignment module embeds the ROI token sequence and visual features in a shared space, using a regularized cosine-similarity loss to narrow the modality gap. Extensive experiments on the ADHD-200 and ABIDE benchmarks show that RTGMFF surpasses current methods in diagnostic accuracy, achieving notable gains in sensitivity, specificity, and area under the ROC curve. Code is available at https://github.com/BeistMedAI/RTGMFF.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes