CVJan 25, 2022

Comparison of Evaluation Metrics for Landmark Detection in CMR Images

arXiv:2201.10410v26 citationsHas Code
AI Analysis

This work is incremental, focusing on improving evaluation standards for landmark detection in cardiac imaging, which is important for researchers and clinicians in medical image analysis.

The study addressed the problem of evaluating landmark detection in cardiac MRI by comparing different metrics and identifying pitfalls in existing evaluation approaches, finding that metric choice significantly affects method rankings and highlighting the need for standardized evaluation protocols.

Cardiac Magnetic Resonance (CMR) images are widely used for cardiac diagnosis and ventricular assessment. Extracting specific landmarks like the right ventricular insertion points is of importance for spatial alignment and 3D modeling. The automatic detection of such landmarks has been tackled by multiple groups using Deep Learning, but relatively little attention has been paid to the failure cases of evaluation metrics in this field. In this work, we extended the public ACDC dataset with additional labels of the right ventricular insertion points and compare different variants of a heatmap-based landmark detection pipeline. In this comparison, we demonstrate very likely pitfalls of apparently simple detection and localisation metrics which highlights the importance of a clear detection strategy and the definition of an upper limit for localisation-based metrics. Our preliminary results indicate that a combination of different metrics is necessary, as they yield different winners for method comparison. Additionally, they highlight the need of a comprehensive metric description and evaluation standardisation, especially for the error cases where no metrics could be computed or where no lower/upper boundary of a metric exists. Code and labels: https://github.com/Cardio-AI/rvip_landmark_detection

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes