Explainable Binary Classification of Separable Shape Ensembles
This addresses the need for principled shape analysis in materials science and related fields, offering an incremental improvement through novel formalisms for pattern recognition.
The paper tackled the problem of quantifying differences in distributions of shapes from segmented curves in materials science, resulting in a method that enables explainable binary classification of shape ensembles with detection of discrepancies below visual inspection, building interpretable feature spaces in seconds without high-performance computation.
Materials scientists utilize image segmentation of micrographs to create large curve ensembles representing grain boundaries of material microstructures. Observations of these collections of shapes can facilitate inferences about material properties and manufacturing processes. We seek to bolster this application, and related engineering/scientific tasks, using novel pattern recognition formalisms and inference over large ensembles of segmented curves -- i.e., facilitate principled assessments for quantifying differences in distributions of shapes. To this end, we apply a composite integral operator to motivate accurate and efficient numerical representations of discrete planar curves over matrix manifolds. The main result is a rigid-invariant orthonormal decomposition of curve component functions into separable forms of scale variations and complementary features of undulation. We demonstrate how these separable shape tensors -- given thousands of curves in an ensemble -- can inform explainable binary classification of segmented images by utilizing a product maximum mean discrepancy to distinguish the shape distributions; absent labelled data, building interpretable feature spaces in seconds without high performance computation, and detecting discrepancies below cursory visual inspections.