Vijay K. Shah

NI
h-index9
17papers
167citations
Novelty45%
AI Score53

17 Papers

91.0NIMay 23Code
OpenTwin: Digital Twin Driven Closed Loop KPM Inference and Control for Open RAN

Md Sharif Hossen, Zifan Zhang, Dara Ron et al.

The open radio access network (O-RAN) RAN intelligent controller (RIC) hosts data-driven xApps and rApps to optimize network performance. However, two challenges hinder ML-driven xApp/rApp development: (i) key performance metric (KPM) data scarcity caused by interface latency, and (ii) network disruption risks when testing and validating AI models directly on live networks. We develop OpenTwin, a digital twin framework built on an open-source O-RAN simulator (ns-O-RAN-flexRIC) and KPM streaming via the O1 interface, deployed within the non-RT RIC. OpenTwin uses a two-step ML approach: an XGBoost model that learns time-varying network behavior to generate simulator configuration parameters, followed by a time-aware recursive least squares (RLS) tuner that continuously corrects KPM deviations between the twin and real-world measurements. A deviation-aware scoring mechanism monitors twin fidelity and automatically triggers resynchronization upon detecting network drift. We demonstrate OpenTwin with an energy-saving xApp that validates control policies in the virtual space before applying reconfigurations to the physical network. Experimental results show that OpenTwin mirrors real-world KPMs with up to 96% accuracy and enables the xApp to significantly reduce energy consumption without disrupting live operations.

SPMar 6, 2023
Keep It Simple: CNN Model Complexity Studies for Interference Classification Tasks

Taiwo Oyedare, Vijay K. Shah, Daniel J. Jakubisin et al.

The growing number of devices using the wireless spectrum makes it important to find ways to minimize interference and optimize the use of the spectrum. Deep learning models, such as convolutional neural networks (CNNs), have been widely utilized to identify, classify, or mitigate interference due to their ability to learn from the data directly. However, there have been limited research on the complexity of such deep learning models. The major focus of deep learning-based wireless classification literature has been on improving classification accuracy, often at the expense of model complexity. This may not be practical for many wireless devices, such as, internet of things (IoT) devices, which usually have very limited computational resources and cannot handle very complex models. Thus, it becomes important to account for model complexity when designing deep learning-based models for interference classification. To address this, we conduct an analysis of CNN based wireless classification that explores the trade-off amongst dataset size, CNN model complexity, and classification accuracy under various levels of classification difficulty: namely, interference classification, heterogeneous transmitter classification, and homogeneous transmitter classification. Our study, based on three wireless datasets, shows that a simpler CNN model with fewer parameters can perform just as well as a more complex model, providing important insights into the use of CNNs in computationally constrained applications.

47.0NIMay 23
Analysis of Altitude-Dependent Electronic Conspicuity in Cellular-Connected UAVs

Md Sharif Hossen, Vijay K. Shah, Ismail Guvenc

Unmanned aerial vehicles (UAVs) are increasingly integrated into cellular networks to support emerging Internet of Things (IoT) applications. In such settings, reliable communication is critical for electronic conspicuity (EC), enabling UAV detection and tracking in shared airspace. However, UAVs operate at elevated altitudes where enhanced line-of-sight (LOS) visibility leads to simultaneous exposure to multiple base stations, resulting in strong inter-cell interference. This article presents a system-level analysis of how UAV altitude influences the radio environment and affects EC reliability. Using spatial and network-level metrics, including serving distance, association behavior, and aggregate received power, we show that increasing altitude leads to stronger multi-cell interaction, reduced dominance of nearby sectors, and interference-dominated connectivity. These effects result in fragmented association regions and increased variability in link performance. The analysis is supported by measurement data from a helikite-based spectrum monitoring campaign and corresponding simulation results. Despite differences in experimental conditions, both approaches exhibit consistent altitude-dependent trends. These findings provide practical insights for designing altitude-aware and interference-aware cellular systems to support reliable UAV operation.

55.7NIMay 23
Altitude-Dependent RSRP and RSRQ Trade-offs in 5G NR UAV Networks

Md Sharif Hossen, Vijay K. Shah, Ismail Guvenc

Cellular-connected unmanned aerial vehicles (UAVs) in 5G NR networks experience propagation and interference conditions that vary significantly with altitude and differ substantially from those experienced by terrestrial users. This is primarily caused by the down-tilted antenna sectors in 5G NR networks, which cause UAVs to be served (and interfered with) by the sidelobes. In this paper, we develop a 3GPP-compliant system-level framework for the consistent characterization of key performance indicators (KPIs) such as reference signal received power (RSRP), reference signal received quality (RSRQ), and signal-to-interference-and-noise ratio (SINR) in a multi-site tri-sector deployment with realistic antenna patterns and probabilistic models for line-of-sight (LOS) and non-LOS (NLOS) conditions. Simulation results demonstrate that a critical transition for aerial users is experienced when going from coverage-limited to interference-limited conditions at higher altitudes. Although RSRP is affected by large-scale propagation characteristics and degrades gradually with increasing altitude and inter-site distance (ISD), SINR degrades much faster due to increased interference caused by LOS conditions. On the contrary, increasing ISD improves SINR and RSRQ due to lower interference, even as received power is reduced.

ITMar 8, 2022
A Practical AoI Scheduler in IoT Networks with Relays

Biplav Choudhury, Prasenjit Karmakar, Vijay K. Shah et al.

Internet of Things (IoT) networks have become ubiquitous as autonomous computing, communication and collaboration among devices become popular for accomplishing various tasks. The use of relays in IoT networks further makes it convenient to deploy IoT networks as relays provide a host of benefits, like increasing the communication range and minimizing power consumption. Existing literature on traditional AoI schedulers for such two-hop relayed IoT networks are limited because they are designed assuming constant/non-changing channel conditions and known (usually, generate-at-will) packet generation patterns. Deep reinforcement learning (DRL) algorithms have been investigated for AoI scheduling in two-hop IoT networks with relays, however, they are only applicable for small-scale IoT networks due to exponential rise in action space as the networks become large. These limitations discourage the practical utilization of AoI schedulers for IoT network deployments. This paper presents a practical AoI scheduler for two-hop IoT networks with relays that addresses the above limitations. The proposed scheduler utilizes a novel voting mechanism based proximal policy optimization (v-PPO) algorithm that maintains a linear action space, enabling it be scale well with larger IoT networks. The proposed v-PPO based AoI scheduler adapts well to changing network conditions and accounts for unknown traffic generation patterns, making it practical for real-world IoT deployments. Simulation results show that the proposed v-PPO based AoI scheduler outperforms both ML and traditional (non-ML) AoI schedulers, such as, Deep Q Network (DQN)-based AoI Scheduler, Maximal Age First-Maximal Age Difference (MAF-MAD), MAF (Maximal Age First) , and round-robin in all considered practical scenarios.

78.7SYMay 22
Advanced AI Service Provisioning in O-RAN through LLM Engine Integration

Seyed Bagher Hashemi Natanzi, Pranshav Gajja, Bo Tang et al.

The Open Radio Access Network (O-RAN) architecture allows AI to be embedded directly into the RAN through modular xApps and rApps, yet creating these applications collecting data, training models, writing code, and deploying them safely remains slow and largely manual. Large Language Models (LLMs) offer strong reasoning and code-generation capabilities but are unsuited for the fast, deterministic inference required in real-time RAN control. We present a proof-of-concept Dual-Brain architecture that combines both strengths: an LLM-based orchestrator translates operator intents into data-collection policies and deployment code, while an automated ML engine, NeuralSmith, trains lightweight classifiers on demand via an API. We describe the architecture and provisioning workflow, share practical insights from a containerized O-RAN 5G~SA testbed, and discuss open research directions.

NIJul 8, 2024
ORAN-Bench-13K: An Open Source Benchmark for Assessing LLMs in Open Radio Access Networks

Pranshav Gajjar, Vijay K. Shah

Large Language Models (LLMs) can revolutionize how we deploy and operate Open Radio Access Networks (O-RAN) by enhancing network analytics, anomaly detection, and code generation and significantly increasing the efficiency and reliability of a plethora of O-RAN tasks. In this paper, we present ORAN-Bench-13K, the first comprehensive benchmark designed to evaluate the performance of Large Language Models (LLMs) within the context of O-RAN. Our benchmark consists of 13,952 meticulously curated multiple-choice questions generated from 116 O-RAN specification documents. We leverage a novel three-stage LLM framework, and the questions are categorized into three distinct difficulties to cover a wide spectrum of ORAN-related knowledge. We thoroughly evaluate the performance of several state-of-the-art LLMs, including Gemini, Chat-GPT, and Mistral. Additionally, we propose ORANSight, a Retrieval-Augmented Generation (RAG)-based pipeline that demonstrates superior performance on ORAN-Bench-13K compared to other tested closed-source models. Our findings indicate that current popular LLM models are not proficient in O-RAN, highlighting the need for specialized models. We observed a noticeable performance improvement when incorporating the RAG-based ORANSight pipeline, with a Macro Accuracy of 0.784 and a Weighted Accuracy of 0.776, which was on average 21.55% and 22.59% better than the other tested LLMs.

51.1LGApr 20
LLM-AUG: Robust Wireless Data Augmentation with In-Context Learning in Large Language Models

Pranshav Gajjar, Manan Tiwari, Sayanta Seth et al.

Data scarcity remains a fundamental bottleneck in applying deep learning to wireless communication problems, particularly in scenarios where collecting labeled Radio Frequency (RF) data is expensive, time-consuming, or operationally constrained. This paper proposes LLM-AUG, a data augmentation framework that leverages in-context learning in large language models (LLMs) to generate synthetic training samples directly in a learned embedding space. Unlike conventional generative approaches that require training task-specific models, LLM-AUG performs data generation through structured prompting, enabling rapid adaptation in low-shot regimes. We evaluate LLM-AUG on two representative tasks: modulation classification and interference classification using the RadioML 2016.10A dataset, and the Interference Classification (IC) dataset respectively. Results show that LLM-AUG consistently outperforms traditional augmentation and deep generative baselines across low-shot settings and reaches near oracle performance using only 15% labeled data. LLM-AUG further demonstrates improved robustness under distribution shifts, yielding a 29.4% relative gain over diffusion-based augmentation at a lower SNR value. On the RadioML and IC datasets, LLM-AUG yields a relative gain of 67.6% and 35.7% over the diffusion-based baseline. The t-SNE visualizations further validate that synthetic samples generated by better preserve class structure in the embedding space, leading to more consistent and informative augmentations. These results demonstrate that LLMs can serve as effective and practical data augmenters for wireless machine learning, enabling robust and data-efficient learning in evolving wireless environments.

79.1LGApr 14
Enhancing Confidence Estimation in Telco LLMs via Twin-Pass CoT-Ensembling

Anton Saenko, Pranshav Gajjar, Abiodun Ganiyu et al.

Large Language Models (LLMs) are increasingly applied to complex telecommunications tasks, including 3GPP specification analysis and O-RAN network troubleshooting. However, a critical limitation remains: LLM-generated confidence scores are often biased and unreliable, frequently exhibiting systematic overconfidence. This lack of trustworthy self-assessment makes it difficult to verify model outputs and safely rely on them in practice. In this paper, we study confidence calibration in telecom-domain LLMs using the representative Gemma-3 model family (4B, 12B, and 27B parameters), evaluated on TeleQnA, ORANBench, and srsRANBench. We show that standard single-pass, verbalized confidence estimates fail to reflect true correctness, often assigning high confidence to incorrect predictions. To address this, we propose a novel Twin-Pass Chain of Thought (CoT)-Ensembling methodology for improving confidence estimation by leveraging multiple independent reasoning evaluations and aggregating their assessments into a calibrated confidence score. Our approach reduces Expected Calibration Error (ECE) by up to 88% across benchmarks, significantly improving the reliability of model self-assessment. These results highlight the limitations of current confidence estimation practices and demonstrate a practical path toward more trustworthy evaluation of LLM outputs in telecommunications.

CLMar 7, 2025Code
ORANSight-2.0: Foundational LLMs for O-RAN

Pranshav Gajjar, Vijay K. Shah

Despite the transformative impact of Large Language Models (LLMs) across critical domains such as healthcare, customer service, and business marketing, their integration into Open Radio Access Networks (O-RAN) remains limited. This gap is primarily due to the absence of domain-specific foundational models, with existing solutions often relying on general-purpose LLMs that fail to address the unique challenges and technical intricacies of O-RAN. To bridge this gap, we introduce ORANSight-2.0 (O-RAN Insights), a pioneering initiative to develop specialized foundational LLMs tailored for O-RAN. Built on 18 models spanning five open-source LLM frameworks -- Mistral, Qwen, Llama, Phi, and Gemma -- ORANSight-2.0 fine-tunes models ranging from 1B to 70B parameters, significantly reducing reliance on proprietary, closed-source models while enhancing performance in O-RAN-specific tasks. At the core of ORANSight-2.0 is RANSTRUCT, a novel Retrieval-Augmented Generation (RAG)-based instruction-tuning framework that employs two LLM agents -- a Mistral-based Question Generator and a Qwen-based Answer Generator -- to create high-quality instruction-tuning datasets. The generated dataset is then used to fine-tune the 18 pre-trained open-source LLMs via QLoRA. To evaluate ORANSight-2.0, we introduce srsRANBench, a novel benchmark designed for code generation and codebase understanding in the context of srsRAN, a widely used 5G O-RAN stack.

60.6NIApr 8
Aerial Booster-Cell Enabled Inter-Cell Interference Coordination for 5G NR Networks

Md Sharif Hossen, Vijay K. Shah, Ismail Guvenc

Cellular-connected unmanned aerial vehicles (UAVs) operating in 5G New Radio (NR) macro networks experience severe and spatially non-uniform downlink interference. This is primarily caused by the interference from the sidelobes of downtilted base station (BS) antennas serving terrestrial users, which limits the ability of the network to provide uniform and high-quality coverage to aerial users. Supporting aerial users requires boosting the coverage of certain cells or sectors, which can further exacerbate inter-cell interference in dense macro deployments. This motivates the need for inter-cell interference coordination (ICIC) in multi-cell 5G NR networks serving both aerial and terrestrial users. In this work, we propose an ICIC framework that jointly optimizes antenna-domain coordination through BS uptilt angle optimization and time-domain interference coordination (TDIC) through NR-compliant scheduling. The framework is formulated as a multi-cell NR macro deployment problem that maximizes the minimum UAV signal-to-interference ratio (SIR) over a spatial grid of UAV locations while maintaining acceptable performance for ground user equipment (GUEs). The resulting optimization problem is non-convex and is solved using bio-inspired optimization techniques, including particle swarm optimization (PSO) and genetic algorithm (GA). Simulation results demonstrate that coordinated uptilt optimization with the booster-cell architecture significantly improves worst-case UAV SIR and downlink reliability in multi-cell 5G NR networks. booster-cell architecture significantly improves worst-case UAV SIR and downlink reliability in multi-cell 5G NR networks.

CRFeb 15, 2024
Preserving Data Privacy for ML-driven Applications in Open Radio Access Networks

Pranshav Gajjar, Azuka Chiejina, Vijay K. Shah

Deep learning offers a promising solution to improve spectrum access techniques by utilizing data-driven approaches to manage and share limited spectrum resources for emerging applications. For several of these applications, the sensitive wireless data (such as spectrograms) are stored in a shared database or multistakeholder cloud environment and are therefore prone to privacy leaks. This paper aims to address such privacy concerns by examining the representative case study of shared database scenarios in 5G Open Radio Access Network (O-RAN) networks where we have a shared database within the near-real-time (near-RT) RAN intelligent controller. We focus on securing the data that can be used by machine learning (ML) models for spectrum sharing and interference mitigation applications without compromising the model and network performances. The underlying idea is to leverage a (i) Shuffling-based learnable encryption technique to encrypt the data, following which, (ii) employ a custom Vision transformer (ViT) as the trained ML model that is capable of performing accurate inferences on such encrypted data. The paper offers a thorough analysis and comparisons with analogous convolutional neural networks (CNN) as well as deeper architectures (such as ResNet-50) as baselines. Our experiments showcase that the proposed approach significantly outperforms the baseline CNN with an improvement of 24.5% and 23.9% for the percent accuracy and F1-Score respectively when operated on encrypted data. Though deeper ResNet-50 architecture is obtained as a slightly more accurate model, with an increase of 4.4%, the proposed approach boasts a reduction of parameters by 99.32%, and thus, offers a much-improved prediction time by nearly 60%.

CROct 10, 2025
A Demonstration of Self-Adaptive Jamming Attack Detection in AI/ML Integrated O-RAN

Md Habibur Rahman, Md Sharif Hossen, Nathan H. Stephenson et al.

The open radio access network (O-RAN) enables modular, intelligent, and programmable 5G network architectures through the adoption of software-defined networking, network function virtualization, and implementation of standardized open interfaces. However, one of the security concerns for O-RAN, which can severely undermine network performance, is jamming attacks. This paper presents SAJD- a self-adaptive jammer detection framework that autonomously detects jamming attacks in AI/ML framework-integrated ORAN environments without human intervention. The SAJD framework forms a closed-loop system that includes near-realtime inference of radio signal jamming via our developed ML-based xApp, as well as continuous monitoring and retraining pipelines through rApps. In this demonstration, we will show how SAJD outperforms state-of-the-art jamming detection xApp (offline trained with manual labels) in terms of accuracy and adaptability under various dynamic and previously unseen interference scenarios in the O-RAN-compliant testbed.

NIJul 12, 2021
AoI-minimizing Scheduling in UAV-relayed IoT Networks

Biplav Choudhury, Vijay K. Shah, Aidin Ferdowsi et al.

Due to flexibility, autonomy and low operational cost, unmanned aerial vehicles (UAVs), as fixed aerial base stations, are increasingly being used as \textit{relays} to collect time-sensitive information (i.e., status updates) from IoT devices and deliver it to the nearby terrestrial base station (TBS), where the information gets processed. In order to ensure timely delivery of information to the TBS (from all IoT devices), optimal scheduling of time-sensitive information over two hop UAV-relayed IoT networks (i.e., IoT device to the UAV [hop 1], and UAV to the TBS [hop 2]) becomes a critical challenge. To address this, we propose scheduling policies for Age of Information (AoI) minimization in such two-hop UAV-relayed IoT networks. To this end, we present a low-complexity MAF-MAD scheduler, that employs Maximum AoI First (MAF) policy for sampling of IoT devices at UAV (hop 1) and Maximum AoI Difference (MAD) policy for updating sampled packets from UAV to the TBS (hop 2). We show that MAF-MAD is the optimal scheduler under ideal conditions, i.e., error-free channels and generate-at-will traffic generation at IoT devices. On the contrary, for realistic conditions, we propose a Deep-Q-Networks (DQN) based scheduler. Our simulation results show that DQN-based scheduler outperforms MAF-MAD scheduler and three other baseline schedulers, i.e., Maximal AoI First (MAF), Round Robin (RR) and Random, employed at both hops under general conditions when the network is small (with 10's of IoT devices). However, it does not scale well with network size whereas MAF-MAD outperforms all other schedulers under all considered scenarios for larger networks.

NIJan 6, 2021
Deep Learning for Fast and Reliable Initial Access in AI-Driven 6G mmWave Networks

Tarun S. Cousik, Vijay K. Shah, Tugba Erpek et al.

We present DeepIA, a deep neural network (DNN) framework for enabling fast and reliable initial access for AI-driven beyond 5G and 6G millimeter (mmWave) networks. DeepIA reduces the beam sweep time compared to a conventional exhaustive search-based IA process by utilizing only a subset of the available beams. DeepIA maps received signal strengths (RSSs) obtained from a subset of beams to the beam that is best oriented to the receiver. In both line of sight (LoS) and non-line of sight (NLoS) conditions, DeepIA reduces the IA time and outperforms the conventional IA's beam prediction accuracy. We show that the beam prediction accuracy of DeepIA saturates with the number of beams used for IA and depends on the particular selection of the beams. In LoS conditions, the selection of the beams is consequential and improves the accuracy by up to 70%. In NLoS situations, it improves accuracy by up to 35%. We find that, averaging multiple RSS snapshots further reduces the number of beams needed and achieves more than 95% accuracy in both LoS and NLoS conditions. Finally, we evaluate the beam prediction time of DeepIA through embedded hardware implementation and show the improvement over the conventional beam sweeping.

NISep 8, 2020
Cross-layer Band Selection and Routing Design for Diverse Band-aware DSA Networks

Pratheek S. Upadhyaya, Vijay K. Shah, Jeffrey H. Reed

As several new spectrum bands are opening up for shared use, a new paradigm of \textit{Diverse Band-aware Dynamic Spectrum Access} (d-DSA) has emerged. d-DSA equips a secondary device with software defined radios (SDRs) and utilize whitespaces (or idle channels) in \textit{multiple bands}, including but not limited to TV, LTE, Citizen Broadband Radio Service (CBRS), unlicensed ISM. In this paper, we propose a decentralized, online multi-agent reinforcement learning based cross-layer BAnd selection and Routing Design (BARD) for such d-DSA networks. BARD not only harnesses whitespaces in multiple spectrum bands, but also accounts for unique electro-magnetic characteristics of those bands to maximize the desired quality of service (QoS) requirements of heterogeneous message packets; while also ensuring no harmful interference to the primary users in the utilized band. Our extensive experiments demonstrate that BARD outperforms the baseline dDSAaR algorithm in terms of message delivery ratio, however, at a relatively higher network latency, for varying number of primary and secondary users. Furthermore, BARD greatly outperforms its single-band DSA variants in terms of both the metrics in all considered scenarios.

SPJun 22, 2020
Fast Initial Access with Deep Learning for Beam Prediction in 5G mmWave Networks

Tarun S. Cousik, Vijay K. Shah, Jeffrey H. Reed et al.

This paper presents DeepIA, a deep learning solution for faster and more accurate initial access (IA) in 5G millimeter wave (mmWave) networks when compared to conventional IA. By utilizing a subset of beams in the IA process, DeepIA removes the need for an exhaustive beam search thereby reducing the beam sweep time in IA. A deep neural network (DNN) is trained to learn the complex mapping from the received signal strengths (RSSs) collected with a reduced number of beams to the optimal spatial beam of the receiver (among a larger set of beams). In test time, DeepIA measures RSSs only from a small number of beams and runs the DNN to predict the best beam for IA. We show that DeepIA reduces the IA time by sweeping fewer beams and significantly outperforms the conventional IA's beam prediction accuracy in both line of sight (LoS) and non-line of sight (NLoS) mmWave channel conditions.