CVIRLGJul 1, 2023

Towards Resource-Efficient Streaming of Large-Scale Medical Image Datasets for Deep Learning

arXiv:2307.00438v3h-index: 14
Originality Incremental advance
AI Analysis

This addresses the problem of resource inefficiency for researchers in medical imaging, though it is incremental as it builds on existing streaming and database concepts.

The authors tackled the challenge of high storage and bandwidth requirements for large-scale medical image datasets by proposing the Medical Image Streaming Toolkit (MIST), which enables streaming of images at different resolutions and formats from a single copy, reducing storage and bandwidth needs without impacting quality.

Large-scale medical imaging datasets have accelerated deep learning (DL) for medical image analysis. However, the large scale of these datasets poses a challenge for researchers, resulting in increased storage and bandwidth requirements for hosting and accessing them. Since different researchers have different use cases and require different resolutions or formats for DL, it is neither feasible to anticipate every researcher's needs nor practical to store data in multiple resolutions and formats. To that end, we propose the Medical Image Streaming Toolkit (MIST), a format-agnostic database that enables streaming of medical images at different resolutions and formats from a single high-resolution copy. We evaluated MIST across eight popular, large-scale medical imaging datasets spanning different body parts, modalities, and formats. Our results showed that our framework reduced the storage and bandwidth requirements for hosting and downloading datasets without impacting image quality. We demonstrate that MIST addresses the challenges posed by large-scale medical imaging datasets by building a data-efficient and format-agnostic database to meet the diverse needs of researchers and reduce barriers to DL research in medical imaging.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes