CEMay 25

From Reports to Ontologies: Ontology-Guided Representation Learning for 12-Lead ECG

arXiv:2605.2588368.3
Predicted impact top 6% in CE · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in medical AI, this work provides a method to leverage structured diagnostic codes for ECG representation learning, reducing reliance on paired clinical text.

MAR-ECG introduces an ontology-guided masked autoregressive framework for 12-lead ECG representation learning that uses SNOMED-CT cardiac graph alignment to supervise the encoder without paired clinical reports. On five downstream benchmarks, it outperforms a strong baseline with mean gains in the low-label regime and achieves performance competitive with multimodal ECG-text methods.

The 12-lead electrocardiogram (ECG) is a quasi-periodic, multi-channel signal with diagnostic content spanning timescales from millisecond waveform morphology to multi-second rhythm dynamics. Existing ECG representation learning relies on signal-only self-supervision or ECG-text multimodal alignment, neither of which exploits the structured diagnostic codes attached to every clinical recording. We present \textbf{MAR-ECG}, an ontology-guided masked autoregressive framework that supervises the encoder with a curated 40-node SNOMED-CT cardiac graph through \emph{graph alignment}, eliminating the need for paired clinical reports. MAR-ECG combines two complementary objectives. First, \emph{graph-smoothed contrastive learning} (GSCL) anchors the encoder's rhythm-pooled features to the SNOMED graph, softening supervision targets by ontology distance so that clinically related concepts reinforce one another rather than function as hard negatives. Second, \emph{multi-scale physiological supervision} complements GSCL with signal-derived patch auxiliaries that target rhythm-physiology statistics extracted automatically from the input, extending supervision beyond the patch tier at no annotation cost. Pretrained on ${\sim}40$K publicly available 12-lead ECGs with SNOMED-CT codes and evaluated by frozen linear probing on five downstream classification benchmarks, MAR-ECG consistently outperforms a strong masked-autoregressive baseline, with mean gains in the low-label regime. Despite the absence of paired clinical text, MAR-ECG achieves performance competitive with state-of-the-art multimodal ECG-text methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes