Sasinee Pruekprasert

h-index6

5papers

43citations

Novelty39%

AI Score41

Ranked #69,744 of 194,257 authors (top 36%)#411 in SY (top 25%)

5 Papers

11.8LGJul 27, 2022

Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Masaki Waga, Ezequiel Castellano, Sasinee Pruekprasert et al.

It is challenging to use reinforcement learning (RL) in cyber-physical systems due to the lack of safety guarantees during learning. Although there have been various proposals to reduce undesired behaviors during learning, most of these techniques require prior system knowledge, and their applicability is limited. This paper aims to reduce undesired behaviors during learning without requiring any prior system knowledge. We propose dynamic shielding: an extension of a model-based safe RL technique called shielding using automata learning. The dynamic shielding technique constructs an approximate system model in parallel with RL using a variant of the RPNI algorithm and suppresses undesired explorations due to the shield constructed from the learned model. Through this combination, potentially unsafe actions can be foreseen before the agent experiences them. Experiments show that our dynamic shield significantly decreases the number of undesired events during training.

1.2SYOct 3, 2019

Decision Making for Autonomous Vehicles at Unsignalized Intersection in Presence of Malicious Vehicles

Sasinee Pruekprasert, Xiaoyi Zhang, Jérémy Dubut et al.

In this paper, we investigate the decision making of autonomous vehicles in an unsignalized intersection in presence of malicious vehicles, which are vehicles that do not respect the law by not using the proper rules of the right of way. Each vehicle computes its control input as a Nash equilibrium of a game determined by the priority order based on its own belief: each of non-malicious vehicle bases its order on the law, while a malicious one considers itself as having priority. To illustrate our method, we provide numerical simulations, with different scenarios given by different cases of malicious vehicles.

8.2SYMay 22

From Visual to Digital: Coordination Scheduling and Its Effect on Safety and Efficiency in UAM Corridors

Akihiro Fujita, Sasinee Pruekprasert, Katsuhiro Nishinari et al.

This paper explores scalable coordination strategies for urban air mobility (UAM) corridors by comparing two representative approaches. The first, inspired by visual flight rules (VFR), is a local coordination strategy relying on spatial information available to each vehicle. The second, conceptually aligned with digital flight rules (DFR), is a global coordination strategy based on shared estimated times of arrival (ETAs) at constrained waypoints (CWPs). To support this comparison, we introduce a lightweight disturbance-avoidance mechanism that enables vehicles to adjust their ETAs in response to forecasted disruptions using shared information. We evaluate these approaches through numerical simulations under varying disturbance levels, comparing the locally reactive VFR-style scheme with the globally coordinated DFR-style scheme. Results show that VFR achieves high throughput in low-traffic scenarios but becomes increasingly prone to collisions at higher traffic densities unless conservative separation is enforced, which reduces traffic efficiency. In contrast, DFR maintains more consistent safety performance and traffic efficiency, even under moderate ETA update propagation delays. These findings highlight the advantages of DFR-style global coordination in managing high-density air traffic control (ATC) operations within UAM corridors.

8.7SYMay 22

Safety-Assured Arrival Scheduling in Sequential UAM Corridor Sections under Speed and Separation Constraints

Sasinee Pruekprasert, Shinji Nakadai, Katsuhiro Nishinari

This paper presents a safety-assured arrival-scheduling framework for Urban Air Mobility (UAM) corridor operations. We propose an analytical method to compute a sufficient ETA gap at Constrained Waypoints (CWPs) that guarantees longitudinal separation along sequential corridor sections with heterogeneous speed limits. The resulting ETA-gap condition depends on section-specific speed bounds and the required separation distance, providing an efficiently computable rule suitable for integration into future digital ETA-scheduling and air traffic management systems. We show that the computed ETA gap ensures safe separation across all corridor sections under prescribed section travel times and speed limits. Numerical simulations for a decreasing-speed corridor confirm that vehicles coordinated with the proposed mechanism adjust their speeds to maintain the required spacing, avoid potential collisions, and support improved traffic flow compared with unscheduled operations.

2.3SYMay 21, 2020

A Game-Theoretic Approach to Decision Making for Multiple Vehicles at Roundabout

Sasinee Pruekprasert, Jérémy Dubut, Xiaoyi Zhang et al.

In this paper, we study the decision making of multiple autonomous vehicles at a roundabout. The behaviours of the vehicles depend on their aggressiveness, which indicates how much they value speed over safety. We propose a distributed decision-making process that balances safety and speed of the vehicles. In the proposed process, each vehicle estimates other vehicles' aggressiveness and formulates the interactions among the vehicles as a finite sequential game. Based on the Nash equilibrium of this game, the vehicle predicts other vehicles' behaviours and makes decisions. We perform numerical simulations to illustrate the effectiveness of the proposed process, both for safety (absence of collisions), and speed (time spent within the roundabout).