LGSPMLJan 6, 2020

Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar

arXiv:2001.01799v213 citations
Originality Synthesis-oriented
AI Analysis

This addresses spectrum sharing for radar systems in congested environments, but it is incremental as it compares existing RL methods without introducing new algorithms.

The paper tackles the problem of enabling radar systems to share congested spectrum with uncooperative communications by applying reinforcement learning techniques, showing that these methods outperform a sense-and-avoid scheme in terms of convergence and detection performance while sharing 100MHz spectrum.

In this work, we first describe a framework for the application of Reinforcement Learning (RL) control to a radar system that operates in a congested spectral setting. We then compare the utility of several RL algorithms through a discussion of experiments performed on Commercial off-the-shelf (COTS) hardware. Each RL technique is evaluated in terms of convergence, radar detection performance achieved in a congested spectral environment, and the ability to share 100MHz spectrum with an uncooperative communications system. We examine policy iteration, which solves an environment posed as a Markov Decision Process (MDP) by directly solving for a stochastic mapping between environmental states and radar waveforms, as well as Deep RL techniques, which utilize a form of Q-Learning to approximate a parameterized function that is used by the radar to select optimal actions. We show that RL techniques are beneficial over a Sense-and-Avoid (SAA) scheme and discuss the conditions under which each approach is most effective.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes