MLSTAT-MECHDATA-ANAug 18, 2017

The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization

arXiv:1708.05715v35 citations
Originality Incremental advance
AI Analysis

This work addresses classification and regression problems in machine learning by offering a method that avoids issues like class imbalance, though it appears incremental as it builds on existing statistical physics concepts.

The authors introduced a statistical physics-inspired supervised learning algorithm for classification and regression, based on stability of predictions using stochastic expansions, which achieved high accuracy on 10 diverse datasets and allowed automatic parameter optimization.

We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as expansions in terms of various stochastic functions. The algorithm predicts the classification/regression values of new data by combining (via voting) the outputs of these numerous linear expansions in randomly chosen functions. The few parameters (typically only one parameter is used in all studied examples) that this model has may be automatically optimized. The algorithm has been tested on 10 diverse training data sets of various types and feature space dimensions. It has been shown to consistently exhibit high accuracy and readily allow for optimization of parameters, while simultaneously avoiding pitfalls of existing algorithms such as those associated with class imbalance. We very briefly speculate on whether spatial coordinates in physical theories may be viewed as emergent "features" that enable a robust machine learning type description of data with generic low order smooth functions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes