MLLGJan 25, 2022

Design choice and machine learning model performances

arXiv:2201.10239v213 citations
AI Analysis

It provides practical guidelines for practitioners in industrial settings, though it is incremental as it systematizes existing approaches without introducing new methods.

This paper tackles the lack of guidelines for jointly selecting experimental designs and machine learning models in industrial applications, finding that specific design-model combinations significantly impact predictive performance across various test functions and noise settings.

An increasing number of publications present the joint application of Design of Experiments (DOE) and machine learning (ML) as a methodology to collect and analyze data on a specific industrial phenomenon. However, the literature shows that the choice of the design for data collection and model for data analysis is often not driven by statistical or algorithmic advantages, thus there is a lack of studies which provide guidelines on what designs and ML models to jointly use for data collection and analysis. This article discusses the choice of design in relation to the ML model performances. A study is conducted that considers 12 experimental designs, 7 families of predictive models, 7 test functions that emulate physical processes, and 8 noise settings, both homoscedastic and heteroscedastic. The results of the research can have an immediate impact on the work of practitioners, providing guidelines for practical applications of DOE and ML.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes