LGPFNov 21, 2022

HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks

arXiv:2211.11172v16 citationsh-index: 11
Originality Incremental advance
AI Analysis

This addresses the need for faster and more efficient auto-scheduling in neural network deployment, particularly for applications like natural language processing and auto-driving, though it is an incremental improvement over existing auto-schedulers.

The paper tackles the problem of slow auto-scheduling for neural network tensor programs by proposing HARL, a hierarchical adaptive reinforcement learning-based auto-scheduler, which improves tensor operator performance by 22% and search speed by 4.3x compared to state-of-the-art methods.

To efficiently perform inference with neural networks, the underlying tensor programs require sufficient tuning efforts before being deployed into production environments. Usually, enormous tensor program candidates need to be sufficiently explored to find the one with the best performance. This is necessary to make the neural network products meet the high demand of real-world applications such as natural language processing, auto-driving, etc. Auto-schedulers are being developed to avoid the need for human intervention. However, due to the gigantic search space and lack of intelligent search guidance, current auto-schedulers require hours to days of tuning time to find the best-performing tensor program for the entire neural network. In this paper, we propose HARL, a reinforcement learning (RL) based auto-scheduler specifically designed for efficient tensor program exploration. HARL uses a hierarchical RL architecture in which learning-based decisions are made at all different levels of search granularity. It also automatically adjusts exploration configurations in real-time for faster performance convergence. As a result, HARL improves the tensor operator performance by 22% and the search speed by 4.3x compared to the state-of-the-art auto-scheduler. Inference performance and search speed are also significantly improved on end-to-end neural networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes