CVJan 30, 2025

Simulation of microstructures and machine learning

arXiv:2501.18313v11 citationsh-index: 20Springer Proceedings in Mathematics & Statistics
Originality Incremental advance
AI Analysis

This work tackles data scarcity in industrial and materials science imaging, but it is incremental as it builds on existing synthetic data generation methods.

The paper addresses the challenge of limited training data for machine learning in image processing by proposing synthetic images generated from stochastic geometry models as a solution, applied to optical quality control and 3D concrete crack segmentation.

Machine learning offers attractive solutions to challenging image processing tasks. Tedious development and parametrization of algorithmic solutions can be replaced by training a convolutional neural network or a random forest with a high potential to generalize. However, machine learning methods rely on huge amounts of representative image data along with a ground truth, usually obtained by manual annotation. Thus, limited availability of training data is a critical bottleneck. We discuss two use cases: optical quality control in industrial production and segmenting crack structures in 3D images of concrete. For optical quality control, all defect types have to be trained but are typically not evenly represented in the training data. Additionally, manual annotation is costly and often inconsistent. It is nearly impossible in the second case: segmentation of crack systems in 3D images of concrete. Synthetic images, generated based on realizations of stochastic geometry models, offer an elegant way out. A wide variety of structure types can be generated. The within structure variation is naturally captured by the stochastic nature of the models and the ground truth is for free. Many new questions arise. In particular, which characteristics of the real image data have to be met to which degree of fidelity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes