IVCVLGJul 19, 2023

Multi-modal Learning based Prediction for Disease

arXiv:2307.09823v12 citationsh-index: 20
Originality Incremental advance
AI Analysis

This work addresses the need for non-invasive NAFLD diagnosis to prevent advanced liver conditions, offering a simpler and potentially more robust method, though it is incremental in improving existing multi-modal techniques.

The paper tackles the problem of non-invasive prediction of non-alcoholic fatty liver disease (NAFLD) by proposing a multi-modal learning system (DeepFLD) that uses clinical metadata and facial images, achieving competitive results with facial images alone and outperforming metadata-only approaches.

Non alcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease, which can be predicted accurately to prevent advanced fibrosis and cirrhosis. While, a liver biopsy, the gold standard for NAFLD diagnosis, is invasive, expensive, and prone to sampling errors. Therefore, non-invasive studies are extremely promising, yet they are still in their infancy due to the lack of comprehensive research data and intelligent methods for multi-modal data. This paper proposes a NAFLD diagnosis system (DeepFLDDiag) combining a comprehensive clinical dataset (FLDData) and a multi-modal learning based NAFLD prediction method (DeepFLD). The dataset includes over 6000 participants physical examinations, laboratory and imaging studies, extensive questionnaires, and facial images of partial participants, which is comprehensive and valuable for clinical studies. From the dataset, we quantitatively analyze and select clinical metadata that most contribute to NAFLD prediction. Furthermore, the proposed DeepFLD, a deep neural network model designed to predict NAFLD using multi-modal input, including metadata and facial images, outperforms the approach that only uses metadata. Satisfactory performance is also verified on other unseen datasets. Inspiringly, DeepFLD can achieve competitive results using only facial images as input rather than metadata, paving the way for a more robust and simpler non-invasive NAFLD diagnosis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes