CVMTRL-SCIJan 14, 2025

Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests

arXiv:2501.08465v12 citationsh-index: 10Digital Discovery
Originality Synthesis-oriented
AI Analysis

It addresses the need for users to estimate model reliability and assess domain shifts in applied machine learning for defect detection in TEM images, though it is incremental as it adapts existing methods to a specific domain.

This study tackled the problem of quantifying prediction uncertainty for object detection models in electron microscopy by developing a random forest regression model to predict F1 scores, achieving a mean absolute error of 0.09 and an R² score of 0.77 on test data.

Quantifying prediction uncertainty when applying object detection models to new, unlabeled datasets is critical in applied machine learning. This study introduces an approach to estimate the performance of deep learning-based object detection models for quantifying defects in transmission electron microscopy (TEM) images, focusing on detecting irradiation-induced cavities in TEM images of metal alloys. We developed a random forest regression model that predicts the object detection F1 score, a statistical metric used to evaluate the ability to accurately locate and classify objects of interest. The random forest model uses features extracted from the predictions of the object detection model whose uncertainty is being quantified, enabling fast prediction on new, unlabeled images. The mean absolute error (MAE) for predicting F1 of the trained model on test data is 0.09, and the $R^2$ score is 0.77, indicating there is a significant correlation between the random forest regression model predicted and true defect detection F1 scores. The approach is shown to be robust across three distinct TEM image datasets with varying imaging and material domains. Our approach enables users to estimate the reliability of a defect detection and segmentation model predictions and assess the applicability of the model to their specific datasets, providing valuable information about possible domain shifts and whether the model needs to be fine-tuned or trained on additional data to be maximally effective for the desired use case.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes