CVJun 19, 2025

Prompt-based Dynamic Token Pruning for Efficient Segmentation of Medical Images

arXiv:2506.16369v2h-index: 4
Originality Incremental advance
AI Analysis

This work addresses efficiency for medical image analysis in resource-constrained environments, though it is incremental as it builds on existing models.

The paper tackles the high computational cost of Vision Transformers in medical image segmentation by proposing a prompt-driven token pruning method, which reduces tokens by 35-55% while maintaining segmentation accuracy.

The high computational demands of Vision Transformers (ViTs) in processing a large number of tokens often constrain their practical application in analyzing medical images. This research proposes a Prompt-driven Adaptive Token ({\it PrATo}) pruning method to selectively reduce the processing of irrelevant tokens in the segmentation pipeline. The prompt-based spatial prior helps to rank the tokens according to their relevance. Tokens with low-relevance scores are down-weighted, ensuring that only the relevant ones are propagated for processing across subsequent stages. This data-driven pruning strategy improves segmentation accuracy and inference speed by allocating computational resources to essential regions. The proposed framework is integrated with several state-of-the-art models to facilitate the elimination of irrelevant tokens, thereby enhancing computational efficiency while preserving segmentation accuracy. The experimental results show a reduction of $\sim$ 35-55% tokens; thus reducing the computational costs relative to baselines. Cost-effective medical image processing, using our framework, facilitates real-time diagnosis by expanding its applicability in resource-constrained environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes