CVAILGFeb 11, 2023

Operation-level Progressive Differentiable Architecture Search

arXiv:2302.05632v16 citationsh-index: 14Has Code
Originality Incremental advance
AI Analysis

This work addresses a critical bottleneck in neural architecture search for machine learning researchers, offering an incremental improvement over existing DARTS methods.

The paper tackles the instability and performance collapse in Differentiable Neural Architecture Search (DARTS) due to skip connections aggregation, proposing operation-level progressive DARTS (OPP-DARTS) to alleviate this issue and explore better architectures, achieving superior performance on CIFAR-10 and improved robustness compared to standard DARTS.

Differentiable Neural Architecture Search (DARTS) is becoming more and more popular among Neural Architecture Search (NAS) methods because of its high search efficiency and low compute cost. However, the stability of DARTS is very inferior, especially skip connections aggregation that leads to performance collapse. Though existing methods leverage Hessian eigenvalues to alleviate skip connections aggregation, they make DARTS unable to explore architectures with better performance. In the paper, we propose operation-level progressive differentiable neural architecture search (OPP-DARTS) to avoid skip connections aggregation and explore better architectures simultaneously. We first divide the search process into several stages during the search phase and increase candidate operations into the search space progressively at the beginning of each stage. It can effectively alleviate the unfair competition between operations during the search phase of DARTS by offsetting the inherent unfair advantage of the skip connection over other operations. Besides, to keep the competition between operations relatively fair and select the operation from the candidate operations set that makes training loss of the supernet largest. The experiment results indicate that our method is effective and efficient. Our method's performance on CIFAR-10 is superior to the architecture found by standard DARTS, and the transferability of our method also surpasses standard DARTS. We further demonstrate the robustness of our method on three simple search spaces, i.e., S2, S3, S4, and the results show us that our method is more robust than standard DARTS. Our code is available at https://github.com/zxunyu/OPP-DARTS.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes