LGMLJul 9, 2021

Multi-headed Neural Ensemble Search

arXiv:2107.04369v14 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of high computational resources in ensemble search for machine learning practitioners, offering a more efficient incremental improvement.

The paper tackled the computational cost of Neural Ensemble Search (NES) by introducing multi-headed ensembles with a shared backbone and multiple prediction heads, enabling end-to-end training and one-shot NAS optimization, which resulted in finding robust ensembles 3 times faster while maintaining comparable performance to other methods.

Ensembles of CNN models trained with different seeds (also known as Deep Ensembles) are known to achieve superior performance over a single copy of the CNN. Neural Ensemble Search (NES) can further boost performance by adding architectural diversity. However, the scope of NES remains prohibitive under limited computational resources. In this work, we extend NES to multi-headed ensembles, which consist of a shared backbone attached to multiple prediction heads. Unlike Deep Ensembles, these multi-headed ensembles can be trained end to end, which enables us to leverage one-shot NAS methods to optimize an ensemble objective. With extensive empirical evaluations, we demonstrate that multi-headed ensemble search finds robust ensembles 3 times faster, while having comparable performance to other ensemble search methods, in both predictive performance and uncertainty calibration.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes