NA LG FLU-DYNJul 9, 2024

Weak baselines and reporting biases lead to overoptimism in machine learning for fluid-related partial differential equations

arXiv:2407.07218v125.6164 citationsh-index: 6

Originality Synthesis-oriented

AI Analysis

This paper highlights reproducibility issues in ML for computational physics, which could mislead researchers and slow progress in applying ML to fluid-related PDEs.

The authors found that 79% of ML-based PDE solver papers claiming superiority over standard methods used weak baselines, and that reporting biases are widespread, leading to overoptimistic results in the field.

One of the most promising applications of machine learning (ML) in computational physics is to accelerate the solution of partial differential equations (PDEs). The key objective of ML-based PDE solvers is to output a sufficiently accurate solution faster than standard numerical methods, which are used as a baseline comparison. We first perform a systematic review of the ML-for-PDE solving literature. Of articles that use ML to solve a fluid-related PDE and claim to outperform a standard numerical method, we determine that 79% (60/76) compare to a weak baseline. Second, we find evidence that reporting biases, especially outcome reporting bias and publication bias, are widespread. We conclude that ML-for-PDE solving research is overoptimistic: weak baselines lead to overly positive results, while reporting biases lead to underreporting of negative results. To a large extent, these issues appear to be caused by factors similar to those of past reproducibility crises: researcher degrees of freedom and a bias towards positive results. We call for bottom-up cultural changes to minimize biased reporting as well as top-down structural reforms intended to reduce perverse incentives for doing so.

View on arXiv PDF

Similar