CVAIDec 14, 2025

Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images

arXiv:2512.12662v12 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of automated thyroid nodule segmentation for clinical diagnosis, but it is incremental as it builds on existing transformer and multi-task learning approaches.

The paper tackled the problem of accurate thyroid nodule segmentation in ultrasound images by proposing SSMT-Net, a semi-supervised multi-task transformer-based network, which outperformed state-of-the-art methods on TN3K and DDTI datasets with higher accuracy and robustness.

Accurate thyroid nodule segmentation in ultrasound images is critical for diagnosis and treatment planning. However, ambiguous boundaries between nodules and surrounding tissues, size variations, and the scarcity of annotated ultrasound data pose significant challenges for automated segmentation. Existing deep learning models struggle to incorporate contextual information from the thyroid gland and generalize effectively across diverse cases. To address these challenges, we propose SSMT-Net, a Semi-Supervised Multi-Task Transformer-based Network that leverages unlabeled data to enhance Transformer-centric encoder feature extraction capability in an initial unsupervised phase. In the supervised phase, the model jointly optimizes nodule segmentation, gland segmentation, and nodule size estimation, integrating both local and global contextual features. Extensive evaluations on the TN3K and DDTI datasets demonstrate that SSMT-Net outperforms state-of-the-art methods, with higher accuracy and robustness, indicating its potential for real-world clinical applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes