Islam Guven

IT
3papers
1citation
Novelty32%
AI Score37

3 Papers

1.1LGMar 11
UAV-MARL: Multi-Agent Reinforcement Learning for Time-Critical and Dynamic Medical Supply Delivery

Islam Guven, Mehmet Parlak

Unmanned aerial vehicles (UAVs) are increasingly used to support time-critical medical supply delivery, providing rapid and flexible logistics during emergencies and resource shortages. However, effective deployment of UAV fleets requires coordination mechanisms capable of prioritizing medical requests, allocating limited aerial resources, and adapting delivery schedules under uncertain operational conditions. This paper presents a multi-agent reinforcement learning (MARL) framework for coordinating UAV fleets in stochastic medical delivery scenarios where requests vary in urgency, location, and delivery deadlines. The problem is formulated as a partially observable Markov decision process (POMDP) in which UAV agents maintain awareness of medical delivery demands while having limited visibility of other agents due to communication and localization constraints. The proposed framework employs Proximal Policy Optimization (PPO) as the primary learning algorithm and evaluates several variants, including asynchronous extensions, classical actor--critic methods, and architectural modifications to analyze scalability and performance trade-offs. The model is evaluated using real-world geographic data from selected clinics and hospitals extracted from the OpenStreetMap dataset. The framework provides a decision-support layer that prioritizes medical tasks, reallocates UAV resources in real time, and assists healthcare personnel in managing urgent logistics. Experimental results show that classical PPO achieves superior coordination performance compared to asynchronous and sequential learning strategies, highlighting the potential of reinforcement learning for adaptive and scalable UAV-assisted healthcare logistics.

49.1SPApr 13
From Equations to Algorithms and Data: Transforming Microwave Engineering and Education with Machine Learning

Mehmet Parlak, Islam Guven

Conventional microwave engineering education relies heavily on analytical methods, canonical circuit topologies, and intuition-driven design, which have proven effective at microwave frequencies. However, as systems increasingly operate in the millimeter-wave and terahertz regimes, parasitic effects, process-dependent electromagnetic interactions, and ultra-wideband performance requirements challenge both topology/layout-constrained traditional design methodologies and existing teaching paradigms. This paper proposes a pedagogical shift in microwave and RFIC (Radio Frequency Integrated Circuit) engineering and education by introducing machine-learning (ML) and data-driven electromagnetic synthesis as a complementary design framework for microwave circuits such as power dividers and combiners, couplers, and baluns. Rather than emphasizing predefined topologies, the proposed approach enables topology-agnostic, performance-oriented exploration of the design space, allowing students to directly engage with electromagnetic behavior through specification-driven synthesis. By integrating machine-learning-based inverse design and multi-objective optimization into the curriculum, the framework enhances physical intuition, encourages design creativity, and better aligns microwave education with emerging industrial practices in high-frequency and ultra-wideband system design.

4.0ITMar 13
JCAS-MARL: Joint Communication and Sensing UAV Networks via Resource-Constrained Multi-Agent Reinforcement Learning

Islam Guven, Mehmet Parlak

Multi-UAV networks are increasingly deployed for large-scale inspection and monitoring missions, where operational performance depends on the coordination of sensing reliability, communication quality, and energy constraints. In particular, the rapid increase in overflowing waste bins and illegal dumping sites has created a need for efficient detection of waste hotspots. In this work, we introduce JCAS-MARL, a resource-aware multi-agent reinforcement learning (MARL) framework for joint communication and sensing (JCAS)-enabled UAV networks. Within this framework, multiple UAVs operate in a shared environment where each agent jointly controls its trajectory and the resource allocation of an OFDM waveform used simultaneously for sensing and communication. Battery consumption, charging behavior, and associated CO$_2$ emissions are incorporated into the system state to model realistic operational constraints. Information sharing occurs over a dynamic communication graph determined by UAV positions and wireless channel conditions. Waste hotspot detection requires consensus among multiple UAVs to improve reliability. Using this environment, we investigate how MARL policies exploit the sensing-communication-energy trade-off in JCAS-enabled UAV networks. Simulation results demonstrate that adaptive pilot-density control learned by the agents can outperform static configurations, particularly in scenarios where sensing accuracy and communication connectivity vary across the environment.