ML LGJan 25, 2022

Design choice and machine learning model performances

Rosa Arboretti, Riccardo Ceccato, Luca Pegoraro, Luigi Salmaso

arXiv:2201.10239v213 citations

AI Analysis

It provides practical guidelines for practitioners in industrial settings, though it is incremental as it systematizes existing approaches without introducing new methods.

This paper tackles the lack of guidelines for jointly selecting experimental designs and machine learning models in industrial applications, finding that specific design-model combinations significantly impact predictive performance across various test functions and noise settings.

An increasing number of publications present the joint application of Design of Experiments (DOE) and machine learning (ML) as a methodology to collect and analyze data on a specific industrial phenomenon. However, the literature shows that the choice of the design for data collection and model for data analysis is often not driven by statistical or algorithmic advantages, thus there is a lack of studies which provide guidelines on what designs and ML models to jointly use for data collection and analysis. This article discusses the choice of design in relation to the ML model performances. A study is conducted that considers 12 experimental designs, 7 families of predictive models, 7 test functions that emulate physical processes, and 8 noise settings, both homoscedastic and heteroscedastic. The results of the research can have an immediate impact on the work of practitioners, providing guidelines for practical applications of DOE and ML.

View on arXiv PDF

Similar