CVMay 31, 2023

Enrichment of the NLST and NSCLC-Radiomics computed tomography collections with AI-derived annotations

arXiv:2306.00150v13 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work provides enriched datasets for researchers in cancer imaging, though it is incremental as it applies existing AI methods to new data.

The study tackled the lack of annotations in public cancer imaging datasets by using AI tools to automatically generate volumetric and slice-level annotations for two chest CT collections, making them publicly available in the NCI Imaging Data Commons with cloud-enabled notebooks for demonstration.

Public imaging datasets are critical for the development and evaluation of automated tools in cancer imaging. Unfortunately, many do not include annotations or image-derived features, complicating their downstream analysis. Artificial intelligence-based annotation tools have been shown to achieve acceptable performance and thus can be used to automatically annotate large datasets. As part of the effort to enrich public data available within NCI Imaging Data Commons (IDC), here we introduce AI-generated annotations for two collections of computed tomography images of the chest, NSCLC-Radiomics, and the National Lung Screening Trial. Using publicly available AI algorithms we derived volumetric annotations of thoracic organs at risk, their corresponding radiomics features, and slice-level annotations of anatomical landmarks and regions. The resulting annotations are publicly available within IDC, where the DICOM format is used to harmonize the data and achieve FAIR principles. The annotations are accompanied by cloud-enabled notebooks demonstrating their use. This study reinforces the need for large, publicly accessible curated datasets and demonstrates how AI can be used to aid in cancer imaging.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes