AI IMFeb 16, 2025

Solving Online Resource-Constrained Scheduling for Follow-Up Observation in Astronomy: a Reinforcement Learning Approach

Yajie Zhang, Ce Yu, Chao Sun, Jizeng Wei, Junhan Ju, Shanjiang Tang

arXiv:2502.11134v13.3h-index: 6Future generations computer systems

Originality Incremental advance

AI Analysis

This addresses the computationally challenging scheduling problem for astronomers managing telescope arrays, but it is incremental as it builds on existing reinforcement learning methods for a specific domain.

The paper tackles the problem of online resource-constrained scheduling for follow-up astronomical observations by proposing ROARS, a reinforcement learning approach that uses directed acyclic graphs and iterative local rewriting, and it shows that ROARS surpasses 5 popular heuristics in simulations based on real-world scenarios.

In the astronomical observation field, determining the allocation of observation resources of the telescope array and planning follow-up observations for targets of opportunity (ToOs) are indispensable components of astronomical scientific discovery. This problem is computationally challenging, given the online observation setting and the abundance of time-varying factors that can affect whether an observation can be conducted. This paper presents ROARS, a reinforcement learning approach for online astronomical resource-constrained scheduling. To capture the structure of the astronomical observation scheduling, we depict every schedule using a directed acyclic graph (DAG), illustrating the dependency of timing between different observation tasks within the schedule. Deep reinforcement learning is used to learn a policy that can improve the feasible solution by iteratively local rewriting until convergence. It can solve the challenge of obtaining a complete solution directly from scratch in astronomical observation scenarios, due to the high computational complexity resulting from numerous spatial and temporal constraints. A simulation environment is developed based on real-world scenarios for experiments, to evaluate the effectiveness of our proposed scheduling approach. The experimental results show that ROARS surpasses 5 popular heuristics, adapts to various observation scenarios and learns effective strategies with hindsight.

View on arXiv PDF

Similar