LG AIFeb 26

Model Agreement via Anchoring

Eric Eaton, Surbhi Goel, Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

arXiv:2602.23360v11.4h-index: 69

Originality Incremental advance

AI Analysis

This work provides theoretical guarantees for reducing model disagreement, which is important for researchers and practitioners concerned with the robustness and reliability of machine learning models. It is an incremental theoretical contribution.

This paper tackles the problem of model disagreement, defined as the expected squared difference in predictions between two models trained independently. They introduce a technique called 'anchoring' to prove that disagreement can be driven to zero for stacked aggregation (with increasing models k), gradient boosting (with increasing iterations k), neural networks with architecture search (with increasing architecture size n), and regression trees (with increasing depth d).

Numerous lines of aim to control $\textit{model disagreement}$ -- the extent to which two machine learning models disagree in their predictions. We adopt a simple and standard notion of model disagreement in real-valued prediction problems, namely the expected squared difference in predictions between two models trained on independent samples, without any coordination of the training processes. We would like to be able to drive disagreement to zero with some natural parameter(s) of the training procedure using analyses that can be applied to existing training methodologies. We develop a simple general technique for proving bounds on independent model disagreement based on $\textit{anchoring}$ to the average of two models within the analysis. We then apply this technique to prove disagreement bounds for four commonly used machine learning algorithms: (1) stacked aggregation over an arbitrary model class (where disagreement is driven to 0 with the number of models $k$ being stacked) (2) gradient boosting (where disagreement is driven to 0 with the number of iterations $k$) (3) neural network training with architecture search (where disagreement is driven to 0 with the size $n$ of the architecture being optimized over) and (4) regression tree training over all regression trees of fixed depth (where disagreement is driven to 0 with the depth $d$ of the tree architecture). For clarity, we work out our initial bounds in the setting of one-dimensional regression with squared error loss -- but then show that all of our results generalize to multi-dimensional regression with any strongly convex loss.

View on arXiv PDF

Similar