CVApr 30, 2025

A simple and effective approach for body part recognition on CT scans based on projection estimation

arXiv:2504.21810v12 citationsh-index: 31Sci Rep
Originality Incremental advance
AI Analysis

This work addresses the need for high-quality medical dataset construction in radiology by providing an effective method for identifying body regions in CT scans, though it is incremental as it builds on existing projection techniques.

The study tackled the problem of body part recognition on CT scans, which is challenging due to volumetric data and incomplete metadata, by proposing a simple approach based on 2D X-ray-like estimation of 3D scans, achieving an F1-Score of 0.980 ± 0.016 and outperforming other methods.

It is well known that machine learning models require a high amount of annotated data to obtain optimal performance. Labelling Computed Tomography (CT) data can be a particularly challenging task due to its volumetric nature and often missing and$/$or incomplete associated meta-data. Even inspecting one CT scan requires additional computer software, or in the case of programming languages $-$ additional programming libraries. This study proposes a simple, yet effective approach based on 2D X-ray-like estimation of 3D CT scans for body region identification. Although body region is commonly associated with the CT scan, it often describes only the focused major body region neglecting other anatomical regions present in the observed CT. In the proposed approach, estimated 2D images were utilized to identify 14 distinct body regions, providing valuable information for constructing a high-quality medical dataset. To evaluate the effectiveness of the proposed method, it was compared against 2.5D, 3D and foundation model (MI2) based approaches. Our approach outperformed the others, where it came on top with statistical significance and F1-Score for the best-performing model EffNet-B0 of 0.980 $\pm$ 0.016 in comparison to the 0.840 $\pm$ 0.114 (2.5D DenseNet-161), 0.854 $\pm$ 0.096 (3D VoxCNN), and 0.852 $\pm$ 0.104 (MI2 foundation model). The utilized dataset comprised three different clinical centers and counted 15,622 CT scans (44,135 labels).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes