CVDec 9, 2024

HYATT-Net is Grand: A Hybrid Attention Network for Performant Anatomical Landmark Detection

arXiv:2412.06499v23 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of balancing global context with computational efficiency in anatomical landmark detection for clinical applications, representing an incremental improvement over existing methods.

The paper tackles the problem of anatomical landmark detection in medical images by proposing HYATT-Net, a hybrid attention network that integrates CNNs and Transformers, achieving state-of-the-art performance in accuracy, robustness, and efficiency across five diverse datasets.

Anatomical landmark detection (ALD) from a medical image is crucial for a wide array of clinical applications. While existing methods achieve quite some success in ALD, they often struggle to balance global context with computational efficiency, particularly with high-resolution images, thereby leading to the rise of a natural question: where is the performance limit of ALD? In this paper, we aim to forge performant ALD by proposing a {\bf HY}brid {\bf ATT}ention {\bf Net}work (HYATT-Net) with the following designs: (i) A novel hybrid architecture that integrates CNNs and Transformers. Its core is the BiFormer module, utilizing Bi-Level Routing Attention for efficient attention to relevant image regions. This, combined with Attention Residual Module(ARM), enables precise local feature refinement guided by the global context. (ii) A Feature Fusion Correction Module that aggregates multi-scale features and thus mitigates a resolution loss. Deep supervision with a mean-square error loss on multi-resolution heatmaps optimizes the model. Experiments on five diverse datasets demonstrate state-of-the-art performance, surpassing existing methods in accuracy, robustness, and efficiency. The HYATT-Net provides a promising solution for accurate and efficient ALD in complex medical images. Our codes and data are already released at: \url{https://github.com/ECNUACRush/HYATT-Net}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes