CVAILGNov 27, 2023

Variational Autoencoders for Feature Exploration and Malignancy Prediction of Lung Lesions

arXiv:2311.15719v15 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the need for interpretable models in clinical practice for early lung cancer diagnosis, though it is incremental as it builds on existing VAE methods.

The study tackled the problem of interpretable AI for lung cancer diagnosis by applying Variational Autoencoders (VAEs) to lung lesion data from CT scans, achieving state-of-the-art results with an AUC of 0.98 and 93.1% accuracy in malignancy prediction.

Lung cancer is responsible for 21% of cancer deaths in the UK and five-year survival rates are heavily influenced by the stage the cancer was identified at. Recent studies have demonstrated the capability of AI methods for accurate and early diagnosis of lung cancer from routine scans. However, this evidence has not translated into clinical practice with one barrier being a lack of interpretable models. This study investigates the application Variational Autoencoders (VAEs), a type of generative AI model, to lung cancer lesions. Proposed models were trained on lesions extracted from 3D CT scans in the LIDC-IDRI public dataset. Latent vector representations of 2D slices produced by the VAEs were explored through clustering to justify their quality and used in an MLP classifier model for lung cancer diagnosis, the best model achieved state-of-the-art metrics of AUC 0.98 and 93.1% accuracy. Cluster analysis shows the VAE latent space separates the dataset of malignant and benign lesions based on meaningful feature components including tumour size, shape, patient and malignancy class. We also include a comparative analysis of the standard Gaussian VAE (GVAE) and the more recent Dirichlet VAE (DirVAE), which replaces the prior with a Dirichlet distribution to encourage a more explainable latent space with disentangled feature representation. Finally, we demonstrate the potential for latent space traversals corresponding to clinically meaningful feature changes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes