LG CVJul 18, 2023

An Evaluation of Zero-Cost Proxies -- from Neural Architecture Performance to Model Robustness

Jovita Lukasik, Michael Moeller, Margret Keuper

arXiv:2307.09365v19.88 citationsh-index: 17

Originality Incremental advance

AI Analysis

This work addresses the problem of efficiently searching for robust neural architectures in neural architecture search, but it is incremental as it builds on existing zero-cost proxy methods.

The paper evaluated zero-cost proxies for predicting neural architecture performance, finding that predicting model robustness is more challenging than clean accuracy and requires combining multiple proxies, while clean accuracy can be predicted from a single proxy.

Zero-cost proxies are nowadays frequently studied and used to search for neural architectures. They show an impressive ability to predict the performance of architectures by making use of their untrained weights. These techniques allow for immense search speed-ups. So far the joint search for well-performing and robust architectures has received much less attention in the field of NAS. Therefore, the main focus of zero-cost proxies is the clean accuracy of architectures, whereas the model robustness should play an evenly important part. In this paper, we analyze the ability of common zero-cost proxies to serve as performance predictors for robustness in the popular NAS-Bench-201 search space. We are interested in the single prediction task for robustness and the joint multi-objective of clean and robust accuracy. We further analyze the feature importance of the proxies and show that predicting the robustness makes the prediction task from existing zero-cost proxies more challenging. As a result, the joint consideration of several proxies becomes necessary to predict a model's robustness while the clean accuracy can be regressed from a single such feature.

View on arXiv PDF

Similar