IV CVFeb 15, 2021

Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings

Sobhan Shafiei, Morteza Babaie, Shivam Kalra, H. R. Tizhoosh

arXiv:2102.07611v16.18 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a specific problem in digital pathology by enhancing dataset utility for researchers, but it is incremental as it builds on an existing dataset with new color data.

The authors tackled the lack of color information in the Kimia Path24 digital pathology dataset by creating a color version, Kimia Path24C, and conducted experiments to benchmark it using deep learning models for image retrieval, achieving up to 95.92% accuracy with DenseNet.

The Kimia Path24 dataset has been introduced as a classification and retrieval dataset for digital pathology. Although it provides multi-class data, the color information has been neglected in the process of extracting patches. The staining information plays a major role in the recognition of tissue patterns. To address this drawback, we introduce the color version of Kimia Path24 by recreating sample patches from all 24 scans to propose Kimia Path24C. We run extensive experiments to determine the best configuration for selected patches. To provide preliminary results for setting a benchmark for the new dataset, we utilize VGG16, InceptionV3 and DenseNet-121 model as feature extractors. Then, we use these feature vectors to retrieve test patches. The accuracy of image retrieval using DenseNet was 95.92% while the highest accuracy using InceptionV3 and VGG16 reached 92.45% and 92%, respectively. We also experimented with "deep barcodes" and established that with a small loss in accuracy (e.g., 93.43% for binarized features for DenseNet instead of 95.92% when the features themselves are used), the search operations can be significantly accelerated.

View on arXiv PDF

Similar