LGJun 8, 2025

Predicting Anthropometric Body Composition Variables Using 3D Optical Imaging and Machine Learning

arXiv:2506.14815v1h-index: 20
Originality Synthesis-oriented
AI Analysis

This provides a more accessible method for early disease diagnosis in healthcare by reducing reliance on expensive DXA scans, though it is incremental as it applies existing machine learning techniques to a new domain.

This work tackled the problem of predicting anthropometric body composition variables like Appendicular Lean Mass (ALM), Body Fat Percentage (BFP), and Bone Mineral Density (BMD) by using 3D optical imaging and machine learning as an alternative to costly DXA scans, achieving errors of ~8% for ALM and BMD with SVR and ~11% for BFP with Least Squares SVR when trained on 80% of the data.

Accurate prediction of anthropometric body composition variables, such as Appendicular Lean Mass (ALM), Body Fat Percentage (BFP), and Bone Mineral Density (BMD), is essential for early diagnosis of several chronic diseases. Currently, researchers rely on Dual-Energy X-ray Absorptiometry (DXA) scans to measure these metrics; however, DXA scans are costly and time-consuming. This work proposes an alternative to DXA scans by applying statistical and machine learning models on biomarkers (height, volume, left calf circumference, etc) obtained from 3D optical images. The dataset consists of 847 patients and was sourced from Pennington Biomedical Research Center. Extracting patients' data in healthcare faces many technical challenges and legal restrictions. However, most supervised machine learning algorithms are inherently data-intensive, requiring a large amount of training data. To overcome these limitations, we implemented a semi-supervised model, the $p$-Laplacian regression model. This paper is the first to demonstrate the application of a $p$-Laplacian model for regression. Our $p$-Laplacian model yielded errors of $\sim13\%$ for ALM, $\sim10\%$ for BMD, and $\sim20\%$ for BFP when the training data accounted for 10 percent of all data. Among the supervised algorithms we implemented, Support Vector Regression (SVR) performed the best for ALM and BMD, yielding errors of $\sim 8\%$ for both, while Least Squares SVR performed the best for BFP with $\sim 11\%$ error when trained on 80 percent of the data. Our findings position the $p$-Laplacian model as a promising tool for healthcare applications, particularly in a data-constrained environment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes