IVCVSep 19, 2025

FMD-TransUNet: Abdominal Multi-Organ Segmentation Based on Frequency Domain Multi-Axis Representation Learning and Dual Attention Mechanisms

arXiv:2509.16044v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses the challenge of precise abdominal multi-organ segmentation for clinical applications, representing an incremental improvement over existing deep learning methods.

The paper tackles the problem of accurately segmenting small, irregular, or complex abdominal organs in medical imaging by proposing FMD-TransUNet, which integrates frequency-domain multi-axis representation learning and dual attention mechanisms into the TransUNet framework. Experimental results on the Synapse dataset show it achieves an average DSC of 81.32% and HD of 16.35 mm, outperforming recent state-of-the-art methods with a 3.84% DSC increase and 15.34 mm HD decrease compared to the baseline.

Accurate abdominal multi-organ segmentation is critical for clinical applications. Although numerous deep learning-based automatic segmentation methods have been developed, they still struggle to segment small, irregular, or anatomically complex organs. Moreover, most current methods focus on spatial-domain analysis, often overlooking the synergistic potential of frequency-domain representations. To address these limitations, we propose a novel framework named FMD-TransUNet for precise abdominal multi-organ segmentation. It innovatively integrates the Multi-axis External Weight Block (MEWB) and the improved dual attention module (DA+) into the TransUNet framework. The MEWB extracts multi-axis frequency-domain features to capture both global anatomical structures and local boundary details, providing complementary information to spatial-domain representations. The DA+ block utilizes depthwise separable convolutions and incorporates spatial and channel attention mechanisms to enhance feature fusion, reduce redundant information, and narrow the semantic gap between the encoder and decoder. Experimental validation on the Synapse dataset shows that FMD-TransUNet outperforms other recent state-of-the-art methods, achieving an average DSC of 81.32\% and a HD of 16.35 mm across eight abdominal organs. Compared to the baseline model, the average DSC increased by 3.84\%, and the average HD decreased by 15.34 mm. These results demonstrate the effectiveness of FMD-TransUNet in improving the accuracy of abdominal multi-organ segmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes