CV AI LGNov 27, 2023

Variational Autoencoders for Feature Exploration and Malignancy Prediction of Lung Lesions

Benjamin Keel, Aaron Quyn, David Jayne, Samuel D. Relton

arXiv:2311.15719v12.85 citationsh-index: 19Has Code

Originality Incremental advance

AI Analysis

This addresses the need for interpretable models in clinical practice for early lung cancer diagnosis, though it is incremental as it builds on existing VAE methods.

The study tackled the problem of interpretable AI for lung cancer diagnosis by applying Variational Autoencoders (VAEs) to lung lesion data from CT scans, achieving state-of-the-art results with an AUC of 0.98 and 93.1% accuracy in malignancy prediction.

Lung cancer is responsible for 21% of cancer deaths in the UK and five-year survival rates are heavily influenced by the stage the cancer was identified at. Recent studies have demonstrated the capability of AI methods for accurate and early diagnosis of lung cancer from routine scans. However, this evidence has not translated into clinical practice with one barrier being a lack of interpretable models. This study investigates the application Variational Autoencoders (VAEs), a type of generative AI model, to lung cancer lesions. Proposed models were trained on lesions extracted from 3D CT scans in the LIDC-IDRI public dataset. Latent vector representations of 2D slices produced by the VAEs were explored through clustering to justify their quality and used in an MLP classifier model for lung cancer diagnosis, the best model achieved state-of-the-art metrics of AUC 0.98 and 93.1% accuracy. Cluster analysis shows the VAE latent space separates the dataset of malignant and benign lesions based on meaningful feature components including tumour size, shape, patient and malignancy class. We also include a comparative analysis of the standard Gaussian VAE (GVAE) and the more recent Dirichlet VAE (DirVAE), which replaces the prior with a Dirichlet distribution to encourage a more explainable latent space with disentangled feature representation. Finally, we demonstrate the potential for latent space traversals corresponding to clinically meaningful feature changes.

View on arXiv PDF Code

Similar