CVMar 21, 2025

CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images

arXiv:2503.17162v26.21 citationsh-index: 4Has CodeIPMI

Originality Incremental advance

AI Analysis

This work addresses limitations in deformable shape analysis for medical image classification, offering a more flexible approach for applications like brain MRI and adrenal CT scans.

The paper tackles the problem of learning deformable shape representations without needing a known template during testing, and demonstrates that their CoRLD framework boosts classification accuracy on medical imaging datasets.

Deformable shape representations, parameterized by deformations relative to a given template, have proven effective for improved image analysis tasks. However, their broader applicability is hindered by two major challenges. First, existing methods mainly rely on a known template during testing, which is impractical and limits flexibility. Second, they often struggle to capture fine-grained, voxel-level distinctions between similar shapes (e.g., anatomical variations among healthy individuals, those with mild cognitive impairment, and diseased states). To address these limitations, we propose a novel framework - Contrastive Representation Learning of Deformable shapes (CoRLD) in learned deformation spaces and demonstrate its effectiveness in the context of image classification. Our CoRLD leverages a class-aware contrastive supervised learning objective in latent deformation spaces, promoting proximity among representations of similar classes while ensuring separation of dissimilar groups. In contrast to previous deep learning networks that require a reference image as input to predict deformation changes, our approach eliminates this dependency. Instead, template images are utilized solely as ground truth in the loss function during the training process, making our model more flexible and generalizable to a wide range of medical applications. We validate CoRLD on diverse datasets, including real brain magnetic resonance imaging (MRIs) and adrenal shapes derived from computed tomography (CT) scans. Experimental results show that our model effectively extracts deformable shape features, which can be easily integrated with existing classifiers to substantially boost the classification accuracy. Our code is available at GitHub.

View on arXiv PDF Code

Similar