LGDec 5, 2025

Enhancing Dimensionality Prediction in Hybrid Metal Halides via Feature Engineering and Class-Imbalance Mitigation

arXiv:2512.05367v1
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of imbalanced data in predicting material properties for researchers in materials science, but it is incremental as it applies existing techniques to a specific domain.

The paper tackled the problem of predicting structural dimensionality in hybrid metal halides using a machine learning framework with feature engineering and class-imbalance mitigation, achieving improved F1-scores for underrepresented classes through robust cross-validation.

We present a machine learning framework for predicting the structural dimensionality of hybrid metal halides (HMHs), including organic-inorganic perovskites, using a combination of chemically-informed feature engineering and advanced class-imbalance handling techniques. The dataset, consisting of 494 HMH structures, is highly imbalanced across dimensionality classes (0D, 1D, 2D, 3D), posing significant challenges to predictive modeling. This dataset was later augmented to 1336 via the Synthetic Minority Oversampling Technique (SMOTE) to mitigate the effects of the class imbalance. We developed interaction-based descriptors and integrated them into a multi-stage workflow that combines feature selection, model stacking, and performance optimization to improve dimensionality prediction accuracy. Our approach significantly improves F1-scores for underrepresented classes, achieving robust cross-validation performance across all dimensionalities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes