LG MLJul 9, 2021

Multi-headed Neural Ensemble Search

Ashwin Raaghav Narayanan, Arber Zela, Tonmoy Saikia, Thomas Brox, Frank Hutter

arXiv:2107.04369v19.94 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of high computational resources in ensemble search for machine learning practitioners, offering a more efficient incremental improvement.

The paper tackled the computational cost of Neural Ensemble Search (NES) by introducing multi-headed ensembles with a shared backbone and multiple prediction heads, enabling end-to-end training and one-shot NAS optimization, which resulted in finding robust ensembles 3 times faster while maintaining comparable performance to other methods.

Ensembles of CNN models trained with different seeds (also known as Deep Ensembles) are known to achieve superior performance over a single copy of the CNN. Neural Ensemble Search (NES) can further boost performance by adding architectural diversity. However, the scope of NES remains prohibitive under limited computational resources. In this work, we extend NES to multi-headed ensembles, which consist of a shared backbone attached to multiple prediction heads. Unlike Deep Ensembles, these multi-headed ensembles can be trained end to end, which enables us to leverage one-shot NAS methods to optimize an ensemble objective. With extensive empirical evaluations, we demonstrate that multi-headed ensemble search finds robust ensembles 3 times faster, while having comparable performance to other ensemble search methods, in both predictive performance and uncertainty calibration.

View on arXiv PDF

Similar