CVDec 9, 2024

Is Self-Supervision Enough? Benchmarking Foundation Models Against End-to-End Training for Mitotic Figure Classification

Jonathan Ganz, Jonas Ammeling, Emely Rosbach, Ludwig Lausser, Christof A. Bertram, Katharina Breininger, Marc Aubreville

arXiv:2412.06365v22.0h-index: 22

Originality Synthesis-oriented

AI Analysis

This challenges assumptions about FMs' benefits for histopathology tasks, indicating incremental insights for medical imaging researchers.

The study tested whether foundation models (FMs) reduce labeled data needs and improve robustness for mitotic figure classification, finding that an end-to-end-trained ResNet50 baseline outperformed all FM-based classifiers across data amounts and showed no better domain robustness.

Foundation models (FMs), i.e., models trained on a vast amount of typically unlabeled data, have become popular and available recently for the domain of histopathology. The key idea is to extract semantically rich vectors from any input patch, allowing for the use of simple subsequent classification networks potentially reducing the required amounts of labeled data, and increasing domain robustness. In this work, we investigate to which degree this also holds for mitotic figure classification. Utilizing two popular public mitotic figure datasets, we compared linear probing of five publicly available FMs against models trained on ImageNet and a simple ResNet50 end-to-end-trained baseline. We found that the end-to-end-trained baseline outperformed all FM-based classifiers, regardless of the amount of data provided. Additionally, we did not observe the FM-based classifiers to be more robust against domain shifts, rendering both of the above assumptions incorrect.

View on arXiv PDF

Similar