LGOct 24, 2022
Federated Learning and Meta Learning: Approaches, Applications, and DirectionsXiaonan Liu, Yansha Deng, Arumugam Nallanathan et al.
Over the past few years, significant advancements have been made in the field of machine learning (ML) to address resource management, interference management, autonomy, and decision-making in wireless networks. Traditional ML approaches rely on centralized methods, where data is collected at a central server for training. However, this approach poses a challenge in terms of preserving the data privacy of devices. To address this issue, federated learning (FL) has emerged as an effective solution that allows edge devices to collaboratively train ML models without compromising data privacy. In FL, local datasets are not shared, and the focus is on learning a global model for a specific task involving all devices. However, FL has limitations when it comes to adapting the model to devices with different data distributions. In such cases, meta learning is considered, as it enables the adaptation of learning models to different data distributions using only a few data samples. In this tutorial, we present a comprehensive review of FL, meta learning, and federated meta learning (FedMeta). Unlike other tutorial papers, our objective is to explore how FL, meta learning, and FedMeta methodologies can be designed, optimized, and evolved, and their applications over wireless networks. We also analyze the relationships among these learning algorithms and examine their advantages and disadvantages in real-world applications.
LGApr 20, 2022
Efficient Wireless Federated Learning with Partial Model AggregationZhixiong Chen, Wenqiang Yi, Arumugam Nallanathan et al.
The data heterogeneity across devices and the limited communication resources, e.g., bandwidth and energy, are two of the main bottlenecks for wireless federated learning (FL). To tackle these challenges, we first devise a novel FL framework with partial model aggregation (PMA). This approach aggregates the lower layers of neural networks, responsible for feature extraction, at the parameter server while keeping the upper layers, responsible for complex pattern recognition, at devices for personalization. The proposed PMA-FL is able to address the data heterogeneity and reduce the transmitted information in wireless channels. Then, we derive a convergence bound of the framework under a non-convex loss function setting to reveal the role of unbalanced data size in the learning performance. On this basis, we maximize the scheduled data size to minimize the global loss function through jointly optimize the device scheduling, bandwidth allocation, computation and communication time division policies with the assistance of Lyapunov optimization. Our analysis reveals that the optimal time division is achieved when the communication and computation parts of PMA-FL have the same power. We also develop a bisection method to solve the optimal bandwidth allocation policy and use the set expansion algorithm to address the device scheduling policy. Compared with the benchmark schemes, the proposed PMA-FL improves 3.13\% and 11.8\% accuracy on two typical datasets with heterogeneous data distribution settings, i.e., MINIST and CIFAR-10, respectively. In addition, the proposed joint dynamic device scheduling and resource management approach achieve slightly higher accuracy than the considered benchmarks, but they provide a satisfactory energy and time reduction: 29\% energy or 20\% time reduction on the MNIST; and 25\% energy or 12.5\% time reduction on the CIFAR-10.
LGAug 10, 2022
A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference ApproachAli Krayani, Atm S. Alam, Lucio Marcenaro et al.
This work proposes a novel resource allocation strategy for anti-jamming in Cognitive Radio using Active Inference ($\textit{AIn}$), and a cognitive-UAV is employed as a case study. An Active Generalized Dynamic Bayesian Network (Active-GDBN) is proposed to represent the external environment that jointly encodes the physical signal dynamics and the dynamic interaction between UAV and jammer in the spectrum. We cast the action and planning as a Bayesian inference problem that can be solved by avoiding surprising states (minimizing abnormality) during online learning. Simulation results verify the effectiveness of the proposed $\textit{AIn}$ approach in minimizing abnormalities (maximizing rewards) and has a high convergence speed by comparing it with the conventional Frequency Hopping and Q-learning.
AIApr 5, 2022
Towards Explainable Meta-Learning for DDoS DetectionQianru Zhou, Rongzhen Li, Lei Xu et al.
The Internet is the most complex machine humankind has ever built, and how to defense it from intrusions is even more complex. With the ever increasing of new intrusions, intrusion detection task rely on Artificial Intelligence more and more. Interpretability and transparency of the machine learning model is the foundation of trust in AI-driven intrusion detection results. Current interpretation Artificial Intelligence technologies in intrusion detection are heuristic, which is neither accurate nor sufficient. This paper proposed a rigorous interpretable Artificial Intelligence driven intrusion detection approach, based on artificial immune system. Details of rigorous interpretation calculation process for a decision tree model is presented. Prime implicant explanation for benign traffic flow are given in detail as rule for negative selection of the cyber immune system. Experiments are carried out in real-life traffic.
97.3ITMar 31
Enhanced Channel Estimation for Flexible Intelligent Metasurface-Aided Communication SystemsJinyue Jiang, Jiancheng An, Lu Gan et al.
Flexible intelligent metasurface (FIM) has recently received considerable interest due to its advantage in realizing a better channel condition by dynamically morphing its surface shape. An FIM consists of multiple elements deposited on a flexible substrate. These elements can not only transmit signals, but also adapt their displacements in a direction perpendicular to the FIM surface via an attached controller. In this paper, we consider the channel estimation problem for the uplink of an FIM-enhanced communication system via customizing the orthogonal matching pursuit (OMP) method. Specifically, we formulate an optimization problem of minimizing the column coherence of the measurement matrix by optimizing the FIM's surface shape, subject to the morphing range constraint. Based on the estimated direction of arrival (DOA) and channel gain, we further investigate the signal-to-noise ratio (SNR) improvement in the FIM-enhanced downlink multiple-input single-output (MISO) system. Numerical results demonstrate that an FIM significantly outperforms a conventional rigid uniform planar array (UPA), thereby showing that FIM can substantially improve channel estimation accuracy and achieve SNR improvement, even when using estimated channel parameters.
91.6ITMay 28
Rate Maximization for Multi-Waveguide PASS: A Hierarchical User Scheduling and Joint Optimization FrameworkGuangyu Li, Xin Sun, Tianwei Hou et al.
Pinching-antenna systems (PASS) have emerged as a promising flexible-antenna architecture capable of dynamically reconfiguring wireless channels by activating dielectric particles along waveguides. The sum rate maximization problem in multi-waveguide PASS is investigated in this study. Both in-waveguide propagation loss and coupling effects are explicitly modeled. To tackle the optimization problem, a hierarchical user scheduling (HUS) algorithm is proposed. The HUS algorithm minimizes the sum of squared distances between users and their associated waveguides to mitigate path loss. Additionally, spatially separated users are assigned within each time slot to reduce inter-user interference. Furthermore, a joint optimization framework integrating power allocation and pinching-antenna (PA) positioning is developed to further improve system sum rate. Specifically, PAs' positions are optimized via one-dimensional search, while the power allocation problem is solved by using the Lagrangian duality and fractional programming. Numerical results show that the HUS algorithm clearly outperforms random pairing, and the proposed power allocation algorithm shows a marked performance improvement over the maximum ratio transmission algorithm. Moreover, the results explicitly demonstrate the considerable impact of in-waveguide propagation loss and coupling effects on the performance of PASS.
88.5ITMay 26
Satellite Navigation: A Transmitting Intelligent Surface (TIS)-aided Indoor SystemDa Guan, Xin Sun, Tianwei Hou et al.
A transmitting intelligent surfaces (TISs) aided satellite indoor navigation system is investigated. By leveraging the unique features of TIS, we address the limitations of conventional global navigation satellite systems (GNSS) in providing reliable positioning services within indoor environments. To facilitate the extension of GNSS indoor signals, we establish an extended line-of-sight link using TIS which has the capability to change signal direction. A three-stage TIS-aided satellite indoor positioning algorithm (TSIPA), which utilizes the positions of TIS arrays and the angle of arrival, is proposed to locate indoor users. To evaluate the distribution of TIS arrays, we propose TIS position dilution of precision (TPDoP) to evaluate centroid deviation and utilize the root mean square error (RMSE) to represent compactness.
97.0SPMay 2
Spectral- and Energy-efficient Multi-BS Multi-RIS Pinching-antenna Systems: A GNN-based ApproachChangpeng He, Yang Lu, Wei Chen et al.
This paper investigates coordinated downlink transmission in a multi-base station (multi-BS) multi-reconfigurable intelligent surface (multi-RIS)-assisted pinching-antenna (PA) system, where each user equipment (UE) is associated with a single BS and each BS is equipped with movable PAs deployed on parallel waveguides. We formulate sum rate (SR) and energy efficiency (EE) maximization problems by jointly optimizing PA placement, RIS phase shifts, transmit beamforming, and BS-UE association under constraints of inter-PA spacing, power budget, and unit-modulus phase shift. To address the resulting highly coupled mixed-variable problem, we propose a three-stage graph neural network (GNN) that integrates heterogeneous and homogeneous graph representations and is trained end-to-end in an unsupervised manner. Extensive numerical results demonstrate that the proposed three-stage GNN consistently outperforms representative system and learning baselines, generalizes well to unseen numbers of UEs, RISs, and BSs, and maintains millisecond-level inference time. Besides, the results validate the effectiveness of the proposed design from both system and architectural perspectives. Moreover, PAs are shown to enhance SR and EE, and the performance gain is enlarged with increasing number of PAs.
74.1ITMay 25
On the Performance of Single/Dual Fluid Antenna SystemsJiangsheng Huangfu, Zhengyu Song, Tianwei Hou et al.
The emerging technology of fluid antenna systems (FASs) represents a promising next-generation reconfigurable antenna solution, capable of exploiting the full spatial diversity within a predefined space by finely reconfiguring the positions of radiating elements. In this paper, the performance of FAS over spatially correlated Rayleigh fading channels is investigated for two distinct scenarios: a multiple-input single-output (MISO) configuration, where a receiver with a single-antenna FAS is served by a multi-antenna transmitter (MISO-FAS), and a single-input single-output setup where single-antenna FASs are equipped at both the transmitter and receiver (Dual-FAS). Exact expressions and closed-form approximations for the outage probability (OP) of both the MISO-FAS and Dual-FAS models are derived as the core contributions of this work. To provide deeper insights into system performance, the diversity orders for each model are also derived and analyzed. Analytical results demonstrate that increasing the number of ports significantly enhances system performance. The theoretical analysis is corroborated by key findings from our simulations, demonstrating that: $i$) Both the MISO-FAS and Dual-FAS models achieve considerable performance gains as the number of ports is increased; $ii$) System performance for both configurations is inversely related to the level of port correlation; lower correlation leads to better performance; $iii$) In the high signal-to-noise ratio regime, the Dual-FAS model surpasses the performance of the MISO-FAS model.
6.2ITApr 20
Channel Estimation for Rydberg Atomic Quantum Receivers: Unrolled Phase Retrieval from Holographic SnapshotsJian Xiao, Ji Wang, Ming Zeng et al.
A model-driven deep learning framework is proposed for channel estimation in Rydberg atomic quantum receivers (RAQRs) based on the measurement of holographic snapshots. Specifically, we develop a Transformer-based unrolling architecture, termed URformer, to solve the non-linear biased phase retrieval problem, which is derived by unrolling a stabilized variant of the expectation-maximization Gerchberg-Saxton (EM-GS) algorithm. Each layer of the proposed URformer incorporates three trainable modules: 1) a learnable filter network that replaces the fixed Bessel kernel in the classic EM-GS algorithm; 2) a trainable gating mechanism that adaptively combines classic updates to ensure training stability; and 3) an efficient channel Transformer module that learns to correct residual errors by capturing non-local channel dependencies. Numerical results demonstrate that the proposed URformer significantly outperforms classic iterative algorithms and conventional black-box neural networks with less pilot overhead.
ITJul 2, 2025
Reconfigurable Intelligent Surface aided Integrated-Navigation-and-Communication in Urban Canyons: A Satellite Selection ApproachTianwei Hou, Da Guan, Xin Sun et al.
This study investigates the application of a simultaneous transmitting and reflecting reconfigurable intelligent surface (STAR-RIS)-aided medium-Earth-orbit (MEO) satellite network for providing both global positioning services and communication services in the urban canyons, where the direct satellite-user links are obstructed. Superposition coding (SC) and successive interference cancellation (SIC) techniques are utilized for the integrated navigation and communication (INAC) networks, and the composed navigation and communication signals are reflected or transmitted to ground users or indoor users located in urban canyons. To meet diverse application needs, navigation-oriented (NO)-INAC and communication-oriented (CO)-INAC have been developed, each tailored according to distinct power allocation factors. We then proposed two algorithms, namely navigation-prioritized-algorithm (NPA) and communication-prioritized-algorithm (CPA), to improve the navigation or communication performance by selecting the satellite with the optimized position dilution of precision (PDoP) or with the best channel gain. The effectiveness of the proposed STAR-RIS-aided INAC network is quantified by analyzing the positioning error for navigation services and by evaluating communication performance through achievable ergodic rate metrics. Our satellite selection approach indicates that: the positioning services at the urban canyon users can be completed with the aid of STAR-RIS. 2) Additionally, it is observed that while a single STAR-RIS array can extend the navigational link, it fails to serve users in indoor scenarios, highlighting a limitation in the current system design.
77.8SYMay 23
Rethinking Satellite Networks: When Navigation Meets CommunicationsTianwei Hou, Da Guan, Yu Zhang et al.
This paper investigates satellite navigation and communication systems in both low-Earth-orbit (LEO) and medium-Earth-orbit (MEO) satellites, which systematically outlines the fundamental principles of satellite navigation systems (SNS), satellite communication systems (SCS), and integrated navigation and communication (INAC) systems. By exploring the enhanced capabilities of satellite systems, the article emphasizes how INAC systems improve overall functionality by enabling efficient signal multiplexing and multiple access, positioning multi-functional satellites as promising alternatives to traditional architectures. Moreover, it introduces emerging frontiers for LEO-based SNS and MEO-based SCS through the integration of advanced sixth-generation (6G) wireless technologies, which cannot be realized through mere extensions of existing communication or navigation techniques. Motivated by these insights, the article further discusses various conceptual transitions required to unlock the full potential of INAC systems, with particular focus on channel capacity, positioning accuracy, and artificial intelligence-enabled waveform design.
SYNov 10, 2025
GNN-Enabled Robust Hybrid Beamforming with Score-Based CSI Generation and DenoisingYuhang Li, Yang Lu, Bo Ai et al.
Accurate Channel State Information (CSI) is critical for Hybrid Beamforming (HBF) tasks. However, obtaining high-resolution CSI remains challenging in practical wireless communication systems. To address this issue, we propose to utilize Graph Neural Networks (GNNs) and score-based generative models to enable robust HBF under imperfect CSI conditions. Firstly, we develop the Hybrid Message Graph Attention Network (HMGAT) which updates both node and edge features through node-level and edge-level message passing. Secondly, we design a Bidirectional Encoder Representations from Transformers (BERT)-based Noise Conditional Score Network (NCSN) to learn the distribution of high-resolution CSI, facilitating CSI generation and data augmentation to further improve HMGAT's performance. Finally, we present a Denoising Score Network (DSN) framework and its instantiation, termed DeBERT, which can denoise imperfect CSI under arbitrary channel error levels, thereby facilitating robust HBF. Experiments on DeepMIMO urban datasets demonstrate the proposed models' superior generalization, scalability, and robustness across various HBF tasks with perfect and imperfect CSI.
LGJan 21, 2021Code
Learning based signal detection for MIMO systems with unknown noise statisticsKe He, Le He, Lisheng Fan et al.
This paper aims to devise a generalized maximum likelihood (ML) estimator to robustly detect signals with unknown noise statistics in multiple-input multiple-output (MIMO) systems. In practice, there is little or even no statistical knowledge on the system noise, which in many cases is non-Gaussian, impulsive and not analyzable. Existing detection methods have mainly focused on specific noise models, which are not robust enough with unknown noise statistics. To tackle this issue, we propose a novel ML detection framework to effectively recover the desired signal. Our framework is a fully probabilistic one that can efficiently approximate the unknown noise distribution through a normalizing flow. Importantly, this framework is driven by an unsupervised learning approach, where only the noise samples are required. To reduce the computational complexity, we further present a low-complexity version of the framework, by utilizing an initial estimation to reduce the search space. Simulation results show that our framework outperforms other existing algorithms in terms of bit error rate (BER) in non-analytical noise environments, while it can reach the ML performance bound in analytical noise environments. The code of this paper is available at https://github.com/skypitcher/manfe.
LGJan 7, 2021Code
Towards Optimally Efficient Search with Deep Learning for Large-Scale MIMO SystemsLe He, Ke He, Lisheng Fan et al.
This paper investigates the optimal signal detection problem with a particular interest in large-scale multiple-input multiple-output (MIMO) systems. The problem is NP-hard and can be solved optimally by searching the shortest path on the decision tree. Unfortunately, the existing optimal search algorithms often involve prohibitively high complexities, which indicates that they are infeasible in large-scale MIMO systems. To address this issue, we propose a general heuristic search algorithm, namely, hyper-accelerated tree search (HATS) algorithm. The proposed algorithm employs a deep neural network (DNN) to estimate the optimal heuristic, and then use the estimated heuristic to speed up the underlying memory-bounded search algorithm. This idea is inspired by the fact that the underlying heuristic search algorithm reaches the optimal efficiency with the optimal heuristic function. Simulation results show that the proposed algorithm reaches almost the optimal bit error rate (BER) performance in large-scale systems, while the memory size can be bounded. In the meanwhile, it visits nearly the fewest tree nodes. This indicates that the proposed algorithm reaches almost the optimal efficiency in practical scenarios, and thereby it is applicable for large-scale systems. Besides, the code for this paper is available at \url{https://github.com/skypitcher/hats}.
72.7ITMay 9
Fluid Antennas Assisted RIS-NOMA Communication NetworksXinwei Yue, He Geng, Jingjing Zhao et al.
This paper introduces a fluid antenna system (FAS) into reconfigurable intelligent surface (RIS) assisted non-orthogonal multiple access (NOMA) communication networks, where the non-orthogonal users are equipped with planar fluid antennas. Specifically, we formulate a sum rate maximization problem for FAS-RIS-NOMA networks, which jointly optimizes the fluid ports, the RIS deployment, and the phase shift matrix. To solve the resulting non-convex optimization problem involving highly coupled variables, an iterative algorithm based on alternating optimization is employed to decompose the original problem into three subproblems. Exhaustive search is employed for optimizing the fluid ports, particle swarm optimization is used for the RIS deployment, and semidefinite relaxation with successive convex approximation is adopted for optimizing the phase shift matrix. Finally, the simulation results show that: 1) compared with traditional antenna systems and orthogonal multiple access, the FAS-RIS-NOMA networks achieve higher system throughput under high signal-to-noise ratio conditions; and 2) by increasing the number of RIS elements and enlarging the FAS size, the sum rate of FAS-RIS-NOMA networks can be significantly enhanced.
73.1ITMay 8
Movable Subarray-Aided Hybrid Beamforming for Near-Field Multiuser CommunicationsXiangqian Xu, Songjie Yang, Arumugam Nallanathan
Movable antenna (MA)-enabled near-field (NF) communications offer significant potential for 6G, yet existing designs often neglect the practical constraints of hybrid beamforming (HBF) for extremely large-scale MIMO (XL-MIMO). Conversely, MA-aided HBF frequently overlooks the rich NF degrees of freedom (DoFs). This paper proposes a movable subarray (MSA)-aided HBF architecture for NF multiuser systems, which strikes a strategic balance between hardware practicality and spatial flexibility. By coupling MSA movement with HBF, the proposed design simultaneously exploits NF distance-dependent and MSA position-dependent DoFs, enabling highly precise beamfocusing and superior interference mitigation. To alleviate the computational burden, a hybrid planar-spherical wave model is introduced for efficient channel approximation. Furthermore, an alternating optimization (AO) algorithm is developed by integrating fractional programming, the alternating direction method of multipliers (ADMM), and projected gradient ascent. Simulation results validate substantial sum-rate gains over fixedposition antenna (FPA) benchmarks.
SPFeb 12, 2025
Joint Transmit and Pinching Beamforming for Pinching Antenna Systems (PASS): Optimization-Based or Learning-Based?Xiaoxia Xu, Xidong Mu, Yuanwei Liu et al.
A novel pinching antenna system (PASS)-enabled downlink multi-user multiple-input single-output (MISO) framework is proposed. PASS consists of multiple waveguides spanning over thousands of wavelength, which equip numerous low-cost dielectric particles, named pinching antennas (PAs), to radiate signals into free space. The positions of PAs can be reconfigured to change both the large-scale path losses and phases of signals, thus facilitating the novel pinching beamforming design. A sum rate maximization problem is formulated, which jointly optimizes the transmit and pinching beamforming to adaptively achieve constructive signal enhancement and destructive interference mitigation. To solve this highly coupled and nonconvex problem, both optimization-based and learning-based methods are proposed. 1) For the optimization-based method, a majorization-minimization and penalty dual decomposition (MM-PDD) algorithm is developed, which handles the nonconvex complex exponential component using a Lipschitz surrogate function and then invokes PDD for problem decoupling. 2) For the learning-based method, a novel Karush-Kuhn-Tucker (KKT)-guided dual learning (KDL) approach is proposed, which enables KKT solutions to be reconstructed in a data-driven manner by learning dual variables. Following this idea, a KDL-Tranformer algorithm is developed, which captures both inter-PA/inter-user dependencies and channel-state-information (CSI)-beamforming dependencies by attention mechanisms. Simulation results demonstrate that: i) The proposed PASS framework significantly outperforms conventional massive multiple input multiple output (MIMO) system even with a few PAs. ii) The proposed KDL-Transformer can improve over 30% system performance than MM-PDD algorithm, while achieving a millisecond-level response on modern GPUs.
62.1DCApr 24
Network Edge Inference for Large Language Models: Principles, Techniques, and OpportunitiesZhixiong Chen, Bingjie Zhu, Jiangzhou Wang et al.
Large language models (LLMs) have advanced rapidly, emerging as versatile tools across fields thanks to their exceptional language understanding, generation, and reasoning capabilities. However, performing LLM inference at the network edge remains challenging due to their large memory and compute demands. This survey outlines the challenges specific to LLM edge inference and provides a comprehensive overview of recent progress, covering system architectures, model optimization and deployment, and resource management and scheduling. By synthesizing state-of-the-art techniques and mapping future directions, this survey aims to unlock the potential of LLMs in resource-constrained edge environments.
NINov 25, 2025
RIS-Assisted Downlink Pinching-Antenna Systems: GNN-Enabled Optimization ApproachesChangpeng He, Yang Lu, Yanqing Xu et al.
This paper investigates a reconfigurable intelligent surface (RIS)-assisted multi-waveguide pinching-antenna (PA) system (PASS) for multi-user downlink information transmission, motivated by the unknown impact of the integration of emerging PASS and RIS on wireless communications. First, we formulate sum rate (SR) and energy efficiency (EE) maximization problems in a unified framework, subject to constraints on the movable region of PAs, total power budget, and tunable phase of RIS elements. Then, by leveraging a graph-structured topology of the RIS-assisted PASS, a novel three-stage graph neural network (GNN) is proposed, which learns PA positions based on user locations, and RIS phase shifts according to composite channel conditions at the first two stages, respectively, and finally determines beamforming vectors. Specifically, the proposed GNN is achieved through unsupervised training, together with three implementation strategies for its integration with convex optimization, thus offering trade-offs between inference time and solution optimality. Extensive numerical results are provided to validate the effectiveness of the proposed GNN, and to support its unique attributes of viable generalization capability, good performance reliability, and real-time applicability. Moreover, the impact of key parameters on RIS-assisted PASS is illustrated and analyzed.
SYSep 17, 2025
Large Language Model-Empowered Decision Transformer for UAV-Enabled Data CollectionZhixion Chen, Jiangzhou Wang, Hyundong Shin et al.
The deployment of unmanned aerial vehicles (UAVs) for reliable and energy-efficient data collection from spatially distributed devices holds great promise in supporting diverse Internet of Things (IoT) applications. Nevertheless, the limited endurance and communication range of UAVs necessitate intelligent trajectory planning. While reinforcement learning (RL) has been extensively explored for UAV trajectory optimization, its interactive nature entails high costs and risks in real-world environments. Offline RL mitigates these issues but remains susceptible to unstable training and heavily rely on expert-quality datasets. To address these challenges, we formulate a joint UAV trajectory planning and resource allocation problem to maximize energy efficiency of data collection. The resource allocation subproblem is first transformed into an equivalent linear programming formulation and solved optimally with polynomial-time complexity. Then, we propose a large language model (LLM)-empowered critic-regularized decision transformer (DT) framework, termed LLM-CRDT, to learn effective UAV control policies. In LLM-CRDT, we incorporate critic networks to regularize the DT model training, thereby integrating the sequence modeling capabilities of DT with critic-based value guidance to enable learning effective policies from suboptimal datasets. Furthermore, to mitigate the data-hungry nature of transformer models, we employ a pre-trained LLM as the transformer backbone of the DT model and adopt a parameter-efficient fine-tuning strategy, i.e., LoRA, enabling rapid adaptation to UAV control tasks with small-scale dataset and low computational overhead. Extensive simulations demonstrate that LLM-CRDT outperforms benchmark online and offline RL methods, achieving up to 36.7\% higher energy efficiency than the current state-of-the-art DT approaches.
SPJun 13, 2024
Federated Contrastive Learning for Personalized Semantic CommunicationYining Wang, Wanli Ni, Wenqiang Yi et al.
In this letter, we design a federated contrastive learning (FedCL) framework aimed at supporting personalized semantic communication. Our FedCL enables collaborative training of local semantic encoders across multiple clients and a global semantic decoder owned by the base station. This framework supports heterogeneous semantic encoders since it does not require client-side model aggregation. Furthermore, to tackle the semantic imbalance issue arising from heterogeneous datasets across distributed clients, we employ contrastive learning to train a semantic centroid generator (SCG). This generator obtains representative global semantic centroids that exhibit intra-semantic compactness and inter-semantic separability. Consequently, it provides superior supervision for learning discriminative local semantic features. Additionally, we conduct theoretical analysis to quantify the convergence performance of FedCL. Simulation results verify the superiority of the proposed FedCL framework compared to other distributed learning benchmarks in terms of task performance and robustness under different numbers of clients and channel conditions, especially in low signal-to-noise ratio and highly heterogeneous data scenarios.
LGMay 15, 2023
Adaptive Federated Pruning in Hierarchical Wireless NetworksXiaonan Liu, Shiqiang Wang, Yansha Deng et al.
Federated Learning (FL) is a promising privacy-preserving distributed learning framework where a server aggregates models updated by multiple devices without accessing their private datasets. Hierarchical FL (HFL), as a device-edge-cloud aggregation hierarchy, can enjoy both the cloud server's access to more datasets and the edge servers' efficient communications with devices. However, the learning latency increases with the HFL network scale due to the increasing number of edge servers and devices with limited local computation capability and communication bandwidth. To address this issue, in this paper, we introduce model pruning for HFL in wireless networks to reduce the neural network scale. We present the convergence analysis of an upper on the l2 norm of gradients for HFL with model pruning, analyze the computation and communication latency of the proposed model pruning scheme, and formulate an optimization problem to maximize the convergence rate under a given latency threshold by jointly optimizing the pruning ratio and wireless resource allocation. By decoupling the optimization problem and using Karush Kuhn Tucker (KKT) conditions, closed-form solutions of pruning ratio and wireless resource allocation are derived. Simulation results show that our proposed HFL with model pruning achieves similar learning accuracy compared with the HFL without model pruning and reduces about 50 percent communication cost.
LGOct 28, 2021
Computational Intelligence and Deep Learning for Next-Generation Edge-Enabled Industrial IoTShunpu Tang, Lunyuan Chen, Ke HeJunjuan Xia et al.
In this paper, we investigate how to deploy computational intelligence and deep learning (DL) in edge-enabled industrial IoT networks. In this system, the IoT devices can collaboratively train a shared model without compromising data privacy. However, due to limited resources in the industrial IoT networks, including computational power, bandwidth, and channel state, it is challenging for many devices to accomplish local training and upload weights to the edge server in time. To address this issue, we propose a novel multi-exit-based federated edge learning (ME-FEEL) framework, where the deep model can be divided into several sub-models with different depths and output prediction from the exit in the corresponding sub-model. In this way, the devices with insufficient computational power can choose the earlier exits and avoid training the complete model, which can help reduce computational latency and enable devices to participate into aggregation as much as possible within a latency threshold. Moreover, we propose a greedy approach-based exit selection and bandwidth allocation algorithm to maximize the total number of exits in each communication round. Simulation experiments are conducted on the classical Fashion-MNIST dataset under a non-independent and identically distributed (non-IID) setting, and it shows that the proposed strategy outperforms the conventional FL. In particular, the proposed ME-FEEL can achieve an accuracy gain up to 32.7% in the industrial IoT networks with the severely limited resources.
LGMar 10, 2021
Machine Learning for Massive Industrial Internet of ThingsHui Zhou, Changyang She, Yansha Deng et al.
Industrial Internet of Things (IIoT) revolutionizes the future manufacturing facilities by integrating the Internet of Things technologies into industrial settings. With the deployment of massive IIoT devices, it is difficult for the wireless network to support the ubiquitous connections with diverse quality-of-service (QoS) requirements. Although machine learning is regarded as a powerful data-driven tool to optimize wireless network, how to apply machine learning to deal with the massive IIoT problems with unique characteristics remains unsolved. In this paper, we first summarize the QoS requirements of the typical massive non-critical and critical IIoT use cases. We then identify unique characteristics in the massive IIoT scenario, and the corresponding machine learning solutions with its limitations and potential research directions. We further present the existing machine learning solutions for individual layer and cross-layer problems in massive IIoT. Last but not the least, we present a case study of massive access problem based on deep neural network and deep reinforcement learning techniques, respectively, to validate the effectiveness of machine learning in massive IIoT scenario.
SPAug 12, 2020
Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA NetworksTiankui Zhang, Ziduan Wang, Yuanwei Liu et al.
This article investigates the cache-enabling unmanned aerial vehicle (UAV) cellular networks with massive access capability supported by non-orthogonal multiple access (NOMA). The delivery of a large volume of multimedia contents for ground users is assisted by a mobile UAV base station, which caches some popular contents for wireless backhaul link traffic offloading. In cache-enabling UAV NOMA networks, the caching placement of content caching phase and radio resource allocation of content delivery phase are crucial for network performance. To cope with the dynamic UAV locations and content requests in practical scenarios, we formulate the long-term caching placement and resource allocation optimization problem for content delivery delay minimization as a Markov decision process (MDP). The UAV acts as an agent to take actions for caching placement and resource allocation, which includes the user scheduling of content requests and the power allocation of NOMA users. In order to tackle the MDP, we propose a Q-learning based caching placement and resource allocation algorithm, where the UAV learns and selects action with \emph{soft ${\varepsilon}$-greedy} strategy to search for the optimal match between actions and states. Since the action-state table size of Q-learning grows with the number of states in the dynamic networks, we propose a function approximation based algorithm with combination of stochastic gradient descent and deep neural networks, which is suitable for large-scale networks. Finally, the numerical results show that the proposed algorithms provide considerable performance compared to benchmark algorithms, and obtain a trade-off between network performance and calculation complexity.
SPJul 22, 2020
Cache-enabling UAV Communications: Network Deployment and Resource AllocationTiankui Zhang, Yi Wang, Yuanwei Liu et al.
In this article, we investigate the content distribution in the hotspot area, whose traffic is offloaded by the combination of the unmanned aerial vehicle (UAV) communication and edge caching. In cache-enabling UAV-assisted cellular networks, the network deployment and resource allocation are vital for quality of experience (QoE) of users with content distribution applications. We formulate a joint optimization problem of UAV deployment, caching placement and user association for maximizing QoE of users, which is evaluated by mean opinion score (MOS). To solve this challenging problem, we decompose the optimization problem into three sub-problems. Specifically, we propose a swap matching based UAV deployment algorithm, then obtain the near-optimal caching placement and user association by greedy algorithm and Lagrange dual, respectively. Finally, we propose a low complexity iterative algorithm for the joint UAV deployment, caching placement and user association optimization, which achieves good computational complexity-optimality tradeoff. Simulation results reveal that: i) the MOS of the proposed algorithm approaches that of the exhaustive search method and converges within several iterations; and ii) compared with the benchmark algorithms, the proposed algorithm achieves better performance in terms of MOS, content access delay and backhaul traffic offloading.
SPNov 10, 2019
Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-assisted Mobile Edge ComputingLiang Wang, Kezhi Wang, Cunhua Pan et al.
In this paper, we consider a platform of flying mobile edge computing (F-MEC), where unmanned aerial vehicles (UAVs) serve as equipment providing computation resource, and they enable task offloading from user equipment (UE). We aim to minimize energy consumption of all the UEs via optimizing the user association, resource allocation and the trajectory of UAVs. To this end, we first propose a Convex optimizAtion based Trajectory control algorithm (CAT), which solves the problem in an iterative way by using block coordinate descent (BCD) method. Then, to make the real-time decision while taking into account the dynamics of the environment (i.e., UAV may take off from different locations), we propose a deep Reinforcement leArning based Trajectory control algorithm (RAT). In RAT, we apply the Prioritized Experience Replay (PER) to improve the convergence of the training procedure. Different from the convex optimization based algorithm which may be susceptible to the initial points and requires iterations, RAT can be adapted to any taking off points of the UAVs and can obtain the solution more rapidly than CAT once training process has been completed. Simulation results show that the proposed CAT and RAT achieve the similar performance and both outperform traditional algorithms.
LGOct 11, 2019
Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression and ChallengeZhiyong Du, Yansha Deng, Weisi Guo et al.
AI heralds a step-change in the performance and capability of wireless networks and other critical infrastructures. However, it may also cause irreversible environmental damage due to their high energy consumption. Here, we address this challenge in the context of 5G and beyond, where there is a complexity explosion in radio resource management (RRM). On the one hand, deep reinforcement learning (DRL) provides a powerful tool for scalable optimization for high dimensional RRM problems in a dynamic environment. On the other hand, DRL algorithms consume a high amount of energy over time and risk compromising progress made in green radio research. This paper reviews and analyzes how to achieve green DRL for RRM via both architecture and algorithm innovations. Architecturally, a cloud based training and distributed decision-making DRL scheme is proposed, where RRM entities can make lightweight deep local decisions whilst assisted by on-cloud training and updating. On the algorithm level, compression approaches are introduced for both deep neural networks and the underlying Markov Decision Processes, enabling accurate low-dimensional representations of challenges. To scale learning across geographic areas, a spatial transfer learning scheme is proposed to further promote the learning efficiency of distributed DRL entities by exploiting the traffic demand correlations. Together, our proposed architecture and algorithms provide a vision for green and on-demand DRL capability.