LG MLJun 15, 2020

Neural Ensemble Search for Uncertainty Estimation and Dataset Shift

Sheheryar Zaidi, Arber Zela, Thomas Elsken, Chris Holmes, Frank Hutter, Yee Whye Teh

arXiv:2006.08573v326.289 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses uncertainty estimation and robustness for machine learning practitioners, offering an incremental improvement over existing ensemble methods.

The paper tackled the problem of improving neural network ensembles for uncertainty estimation and dataset shift by proposing methods to automatically construct ensembles with varying architectures, showing that these ensembles outperform deep ensembles in accuracy, uncertainty calibration, and robustness, with evidence of higher diversity.

Ensembles of neural networks achieve superior performance compared to stand-alone networks in terms of accuracy, uncertainty calibration and robustness to dataset shift. \emph{Deep ensembles}, a state-of-the-art method for uncertainty estimation, only ensemble random initializations of a \emph{fixed} architecture. Instead, we propose two methods for automatically constructing ensembles with \emph{varying} architectures, which implicitly trade-off individual architectures' strengths against the ensemble's diversity and exploit architectural variation as a source of diversity. On a variety of classification tasks and modern architecture search spaces, we show that the resulting ensembles outperform deep ensembles not only in terms of accuracy but also uncertainty calibration and robustness to dataset shift. Our further analysis and ablation studies provide evidence of higher ensemble diversity due to architectural variation, resulting in ensembles that can outperform deep ensembles, even when having weaker average base learners. To foster reproducibility, our code is available: \url{https://github.com/automl/nes}

View on arXiv PDF Code

Similar