Robert Schober

h-index33

20papers

557citations

Novelty41%

AI Score54

Ranked #30,036 of 201,326 authors (top 15%)#69 in IT (top 8%)

20 Papers

97.7ITJun 4

Rotatable Antenna-Enhanced Cell-Free Communication

Kecheng Pan, Beixiong Zheng, Yanhua Tan et al.

Rotatable antenna (RA) is a promising technology that can exploit new spatial degrees-of-freedom (DoFs) by flexibly adjusting the three-dimensional (3D) boresight direction of antennas. In this letter, we investigate an RA-enhanced cell-free system for downlink transmission, where multiple RA-equipped access points (APs) cooperatively serve multiple single-antenna users over the same time-frequency resource. Specifically, we aim to maximize the sum rate of all users by jointly optimizing the AP-user associations and the RA boresight directions. Accordingly, we propose a two-stage strategy to solve the AP-user association problem, and then employ fractional programming (FP) and successive convex approximation (SCA) techniques to optimize the RA boresight directions. Numerical results demonstrate that the proposed RA-enhanced cell-free system significantly outperforms various benchmark schemes.

6.9SPMay 31Code

Communicating Smartly in Molecular Communication Environments: Neural Networks in the Internet of Bio-Nano Things

Jorge Torres Gómez, Pit Hofmann, Lisa Y. Debus et al.

Recent developments in the Internet of Bio-Nano-Things (IoBNT) are laying the foundation for innovative healthcare applications that envision a network of remotely coordinated nanodevices within the human body to monitor and actuate over potential diseases. However, interconnecting such nanodevices requires communication strategies that can cope with molecular communication (MC) channels, whose complex, stochastic, and dynamic behavior often makes accurate physical modeling infeasible. To explore the limits of nanodevice interconnectivity under these conditions, this survey focuses on data-driven communication strategies for MC systems, with particular emphasis on machine learning (ML) methods and neural network (NN) architectures for a robust and adaptive communication scheme at the nanoscale. Research on NN-enabled MC spans several aspects covered in this survey, including NNs for communication in IoBNT networks, the feasibility of biocompatible NN realization, explainable approaches, and the generation of training datasets. We also include open-source code examples to support reproducible research across key MC scenarios. Finally, we identify emerging challenges, including the need for robust NN architectures, biologically integrated NN modules, and scalable training strategies.

AIAug 29, 2023

LAMBO: Large AI Model Empowered Edge Intelligence

Li Dong, Feibo Jiang, Yubo Peng et al.

Next-generation edge intelligence is anticipated to benefit various applications via offloading techniques. However, traditional offloading architectures face several issues, including heterogeneous constraints, partial perception, uncertain generalization, and lack of tractability. In this paper, we propose a Large AI Model-Based Offloading (LAMBO) framework with over one billion parameters for solving these problems. We first use input embedding (IE) to achieve normalized feature representation with heterogeneous constraints and task prompts. Then, we introduce a novel asymmetric encoder-decoder (AED) as the decision-making model, which is an improved transformer architecture consisting of a deep encoder and a shallow decoder for global perception and decision. Next, actor-critic learning (ACL) is used to pre-train the AED for different optimization tasks under corresponding prompts, enhancing the AED's generalization in multi-task scenarios. Finally, we propose an active learning from expert feedback (ALEF) method to fine-tune the decoder of the AED for tracking changes in dynamic environments. Our simulation results validate the advantages of the proposed LAMBO framework.

69.3ITJun 2

On the Impact of Pinching Antennas on Traffic Offloading

Zhiguo Ding, Robert Schober, H. Vincent Poor

Pinching antennas are characterized by their capability to create strong line-of-sight connections and realize multi-antenna systems in a flexible manner. Existing works have demonstrated the significant potential of pinching antennas for physical layer design. The aim of this paper is to investigate how pinching antennas can be used to reshape the architecture of future networks. In particular, this paper is motivated by the key advantage of pinching antennas, which is to reconfigure the physical boundaries of wireless cells, and focuses on the impact of pinching antennas on traffic offloading. The models for traffic offloading and pinching antenna transmission are presented first. Then, two traffic offloading strategies are developed based on whether an offloading user releases its bandwidth in its original cell. An overall transmit power minimization problem is formulated, where the optimal solutions for the transmit powers and antenna locations are obtained. The presented simulation results demonstrate that the use of pinching antennas can efficiently support traffic offloading, yield low energy consumption, and achieve balanced cell resource utilization.

91.9ITJun 1

Rotatable Antenna-Enabled Satellite Communication: Joint Design of Boresight Alignment and Beam Tracking

Tiantian Ma, Beixiong Zheng, Changsheng You et al.

Low Earth orbit (LEO) satellite links experience rapid angular variation due to high orbital velocities, which causes severe beam misalignment and array gain degradation under conventional fixed-antenna architectures. In this letter, we propose a rotatable antenna (RA)-enabled LEO communication framework, where RA arrays are deployed at both the satellite and the ground node (GN) to exploit antenna boresight reconfiguration as an additional spatial degree-of-freedom (DoF) for maintaining directional alignment under high mobility. By leveraging the rank-one line-of-sight (LoS) channel structure inherent to satellite links, we derive closed-form solutions for the joint design of the transmit/receive beamforming and antenna boresight directions, revealing that optimal performance can be achieved via decoupled alignment across antennas with low computational complexity. To enable practical operation under dynamic conditions, we further develop a channel estimation and beam tracking protocol that exploits the predictable satellite orbit to continuously update boresight directions with low training overhead. Simulation results demonstrate that the proposed RA-enabled design significantly outperforms fixed and random boresight baselines in terms of achievable rate and robustness to angular variations, highlighting the effectiveness of rotational spatial reconfiguration in high-mobility satellite communications.

92.6ITMar 26

Rotatable Antenna-Empowered Wireless Networks: A Tutorial

Beixiong Zheng, Qingjie Wu, Xue Xiong et al.

Non-fixed flexible antenna architectures, such as fluid antenna system (FAS), movable antenna (MA), and pinching antenna, have garnered significant interest in recent years. Among them, rotatable antenna (RA) has emerged as a promising technology for enhancing wireless communication and sensing performance through flexible antenna orientation/boresight rotation. By enabling mechanical or electronic boresight adjustment without altering physical antenna positions, RA introduces additional spatial degrees of freedom (DoFs) beyond conventional beamforming. In this paper, we provide a comprehensive tutorial on the fundamentals, architectures, and applications of RA-empowered wireless networks. Specifically, we begin by reviewing the historical evolution of RA-related technologies and clarifying the distinctive role of RA among flexible antenna architectures. Then, we establish a unified mathematical framework for RA-enabled systems, including general antenna/array rotation models, as well as channel models that cover near- and far-field propagation characteristics, wideband frequency selectivity, and polarization effects. Building upon this foundation, we investigate antenna/array rotation optimization in representative communication and sensing scenarios. Furthermore, we examine RA channel estimation/acquisition strategies encompassing orientation scheduling mechanisms and signal processing methods that exploit multi-view channel observations. Beyond theoretical modeling and algorithmic design, we discuss practical RA configurations and deployment strategies. We also present recent RA prototypes and experimental results that validate the practical performance gains enabled by antenna rotation. Finally, we highlight promising extensions of RA to emerging wireless paradigms and outline open challenges to inspire future research.

23.8ETApr 16

Source Distance Estimation in Turbulent Airflow: Exploiting Molecule Degradation Diversity

Bastian Heinlein, Timo Jakumeit, Robert Schober et al.

In nature, estimating the location of a molecule source in turbulent airflow is a central, and yet highly challenging problem for mate search and foraging. Recently, it has also received increasing attention in synthetic molecular communication (SMC), e.g., for leakage detection. One important aspect of source localization is to estimate the distance to the molecule source, e.g., to determine whether it is worth to travel to a potential mating partner or food source, or to decide whether a leak is close enough for inspection. In this study, based on realistic simulations, we show that the diversity induced by molecule mixtures can aid source localization. In particular, when different molecule types in a mixture are subject to atmospheric degradation with different degradation rates, the relative abundance of the different species observed at the receiver enables low-complexity estimation of the source distance. Furthermore, this feature can be combined with already established concentration-based and temporal features of observed molecular signals to further increase estimation accuracy. Thereby, we show that molecule degradation diversity of molecule mixtures can help to realize one of the important envisioned SMC applications, namely source localization, even in turbulent airflow, opening new opportunities for the exploitation of SMC to solve real-world problems.

14.5ITMar 24

Autoencoder-based Optimization of Multi-user Molecule Mixture Communication Systems

Bastian Heinlein, Nuria Zurita Jiménez, Kaikai Zhu et al.

In this paper, we introduce an autoencoder (AE)-based scheme for end-to-end optimization of a multi-user molecule mixture communication system. In the proposed scheme, each transmitter leverages an encoder network that maps the user symbol to a molecule mixture. The mixtures then propagate through the channel to the receiver, which samples the channel using a non-linear, cross-reactive sensor array. A decoder network then estimates the symbol transmitted by each user based on the sensor observations. The proposed scheme achieves, for a given signal-to-noise ratio, lower symbol error rates than a baseline scheme from the literature in a single-user setting with full channel state information. We additionally demonstrate that the proposed AE-based scheme allows reliable communication when the channel is unknown or changing. Finally, we show that for multiple access the system can account for different user priorities. In summary, the proposed AE-based scheme enables end-to-end system optimization in complex scenarios unsuitable for analytical treatment and thereby brings molecular communication systems closer to real-world deployment.

31.5SYMar 16

Matched Filter-Based Molecule Source Localization in Advection-Diffusion-Driven Pipe Networks with Known Topology

Timo Jakumeit, Bastian Heinlein, Vukašin Spasojević et al.

Synthetic molecular communication (MC) has emerged as a powerful framework for modeling, analyzing, and designing communication systems where information is encoded into properties of molecules. Among the envisioned applications of MC is the localization of molecule sources in pipe networks (PNs) like the human cardiovascular system (CVS), sewage networks (SNs), and industrial plants. While existing algorithms mostly focus on simplified scenarios, in this paper, we propose the first framework for source localization in complex PNs with known topology, by leveraging the mixture of inverse Gaussians for hemodynamic transport (MIGHT) model as a closed-form representation for advection-diffusion-driven MC in PNs. We propose a matched filter (MF)-based approach to identify molecule sources under realistic conditions such as unknown release times, random numbers of released molecules, sensor noise, and limited sensor sampling rate. We apply the algorithm to localize a source of viral markers in a real-world SN and show that the proposed scheme outperforms randomly guessing sources even at low signal-to-noise ratios (SNRs) at the sensor and achieves error-free localization under favorable conditions, i.e., high SNRs and sampling rates. Furthermore, by identifying clusters of frequently confused sources, reliable cluster-level localization is possible at substantially lower SNRs and sampling rates.

11.9ITApr 11

Deep Reinforcement Learning for Cognitive Time-Division Joint SAR and Secure Communications

Mohamed-Amine Lahmeri, Ata Khalili, Yujiao Liu et al.

Synthetic aperture radar (SAR) imaging can be exploited to enhance wireless communication performance through high-precision environmental awareness. However, integrating sensing and communication functionalities in such wideband systems remains challenging, motivating the development of a joint SAR and communication (JSARC) framework. We propose a dynamic time-division JSARC (TD-JSARC) framework for secure aerial communications that is relevant for critical scenarios, such as surveillance or post-disaster communication, where conventional localization of mobile adversaries often fails. In particular, we consider a secure downlink communication scenario where an aerial base station (ABS) serves a ground user (UE) in the presence of a ground-moving eavesdropper. To detect and track the eavesdropper, the ABS uses cognitive SAR along-track interferometry (ATI) to estimate its position and velocity. Based on these estimates, the ABS applies adaptive beamforming and artificial-noise jamming to enhance secrecy. To this end, we jointly optimize the time and power allocation to maximize the worst-case secrecy rate, while satisfying both SAR and communication constraints. Using the estimated eavesdropper trajectory, we formulate the problem as a Markov decision process (MDP) and solve it via deep reinforcement learning (DRL). Simulation results show that the proposed learning-based approach outperforms both learning and non-learning baseline schemes employing equal-aperture and random time allocation. The proposed method also generalizes well to previously unseen eavesdropper motion patterns.

64.5ITApr 1

Multipath Channel Metrics and Detection in Vascular Molecular Communication: A Wireless-Inspired Perspective

Timo Jakumeit, Lukas Brand, Josep M. Jornet et al.

Motivated by classical communications engineering, early works in molecular communication (MC) largely adopted established modeling and signal processing concepts from wireless electromagnetic communication systems. In the context of the human cardiovascular system (CVS), MC channel models evolved from simple unbounded and single-duct environments mimicking individual blood vessels to complex vessel network (VN) topologies, generally at the expense of analytical tractability. Up until now, this has largely prohibited rigorous communication-theoretic analysis of large-scale VNs. In this work, we leverage a recently established closed-form analytical channel model for VNs, named mixture of inverse Gaussians for hemodynamic transport (MIGHT), to conduct the first systematic communication-theoretic study of MC in complex, large-scale VNs. Based on MIGHT, we derive a Poisson channel noise model and unveil structural analogies between multipath wireless communications (MWC) and advective-diffusive MC in VNs. In particular, we establish classical MWC metrics, namely the root mean squared (RMS) delay spread, the mean excess delay, and the coherence bandwidth, for MC in VNs and derive closed-form expressions for the channel frequency response and power delay profile (PDP). Building on this characterization, we propose a VN-adapted, coherent decision-feedback (DF) detector and show how the derived multipath metrics can inform the choice of critical system parameters like the symbol duration, the sampling time, and the memory length. Additionally, we evaluate the detector's performance in different VNs exhibiting inter-symbol interference (ISI). Together, these contributions open the door to a systematic, MWC-inspired MC system design for large-scale VNs.

AIJan 27

ComAgent: Multi-LLM based Agentic AI Empowered Intelligent Wireless Networks

Haoyun Li, Ming Xiao, Kezhi Wang et al.

Emerging 6G networks rely on complex cross-layer optimization, yet manually translating high-level intents into mathematical formulations remains a bottleneck. While Large Language Models (LLMs) offer promise, monolithic approaches often lack sufficient domain grounding, constraint awareness, and verification capabilities. To address this, we present ComAgent, a multi-LLM agentic AI framework. ComAgent employs a closed-loop Perception-Planning-Action-Reflection cycle, coordinating specialized agents for literature search, coding, and scoring to autonomously generate solver-ready formulations and reproducible simulations. By iteratively decomposing problems and self-correcting errors, the framework effectively bridges the gap between user intent and execution. Evaluations demonstrate that ComAgent achieves expert-comparable performance in complex beamforming optimization and outperforms monolithic LLMs across diverse wireless tasks, highlighting its potential for automating design in emerging wireless networks.

71.8ITMar 25

Robust and Secure Near-Field Communication via Curved Caustic Beams

Shicong Liu, Xianghao Yu, Robert Schober

Near-field beamfocusing with extremely large aperture arrays can effectively enhance physical layer security. Nevertheless, even small estimation errors of the eavesdropper's location may cause a pronounced focal shift, resulting in a severe degradation of the secrecy rate. In this letter, we propose a physics-informed robust beamforming strategy that leverages the electromagnetic (EM) caustic effect for near-field physical layer security provisioning, which can be implemented via phase shifts only. Specifically, we partition the transmit array into caustic and focusing subarrays to simultaneously bypass the potential eavesdropping region and illuminate the legitimate user, thereby significantly improving the robustness against the localization error of eavesdroppers. Moreover, by leveraging the connection between the phase gradient and the EM wave departing angle, we derive the corresponding piece-wise closed-form array phase profile for the subarrays. Simulation results demonstrate that the proposed scheme achieves up to an 80% reduction of the worst-case eavesdropping rate for a localization error of 0.25 m, highlighting its superiority for providing robust and secure communication.

LGJan 8, 2025

Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions

Na Yan, Yang Su, Yansha Deng et al.

Federated learning (FL) provides a privacy-preserving solution for fine-tuning pre-trained large language models (LLMs) using distributed private datasets, enabling task-specific adaptation while preserving data privacy. However, fine-tuning the extensive parameters in LLMs is particularly challenging in resource-constrained federated scenarios due to the significant communication and computational costs. To gain a deeper understanding of how these challenges can be addressed, this article conducts a comparative analysis three advanced federated LLM (FedLLM) frameworks that integrate knowledge distillation (KD) and split learning (SL) to mitigate these issues: 1) FedLLMs, where clients upload model parameters or gradients to enable straightforward and effective fine-tuning; 2) KD-FedLLMs, which leverage KD for efficient knowledge sharing via logits; and 3) Split-FedLLMs, which split the LLMs into two parts, with one part executed on the client and the other one on the server, to balance the computational load. Each framework is evaluated based on key performance metrics, including model accuracy, communication overhead, and client-side computational load, offering insights into their effectiveness for various federated fine-tuning scenarios. Through this analysis, we identify framework-specific optimization opportunities to enhance the efficiency of FedLLMs and discuss broader research directions, highlighting open opportunities to better adapt FedLLMs for real-world applications. A use case is presented to demonstrate the performance comparison of these three frameworks under varying configurations and settings.

60.8ETApr 9

Analytical Modeling of Dispersive Closed-loop MC Channels with Pulsatile Flow

Theofilos Symeonidis, Fardad Vakilipoor, Robert Schober et al.

Molecular communication (MC) is a communication paradigm in which information is conveyed through the controlled release, propagation, and reception of molecules. Many envisioned healthcare applications of MC are expected to operate inside the human body. In this environment, the cardiovascular system ( CVS) acts as the physical channel, which forms a closed-loop network where particle transport is mainly governed by the combined effects of diffusion and flow. Despite the fact that physiological flows in many parts of the human body are inherently pulsatile due to the cardiac cycle, most existing models for dispersive closed-loop MC channels assume a constant flow velocity. In this paper, we present a time-variant one-dimensional (1D ) channel model for dispersive closed-loop MC systems with pulsatile flow. We derive an analytical expression for the channel impulse response (CIR ), which follows a wrapped Normal distribution with time-variant mean and variance. The obtained model reveals the cyclostationary nature of the channel and quantifies the influence of pulsation on the temporal concentration profile compared to steady-flow systems. Finally, the model is validated by three-dimensional ( 3D ) particle-based simulations (PBS s), showing excellent agreement and enabling an efficient analytical characterization of the channel.

SPOct 21, 2025

DiffPace: Diffusion-based Plug-and-play Augmented Channel Estimation in mmWave and Terahertz Ultra-Massive MIMO Systems

Zhengdong Hu, Chong Han, Wolfgang Gerstacker et al.

Millimeter-wave (mmWave) and Terahertz (THz)-band communications hold great promise in meeting the growing data-rate demands of next-generation wireless networks, offering abundant bandwidth. To mitigate the severe path loss inherent to these high frequencies and reduce hardware costs, ultra-massive multiple-input multiple-output (UM-MIMO) systems with hybrid beamforming architectures can deliver substantial beamforming gains and enhanced spectral efficiency. However, accurate channel estimation (CE) in mmWave and THz UM-MIMO systems is challenging due to high channel dimensionality and compressed observations from a limited number of RF chains, while the hybrid near- and far-field radiation patterns, arising from large array apertures and high carrier frequencies, further complicate CE. Conventional compressive sensing based frameworks rely on predefined sparsifying matrices, which cannot faithfully capture the hybrid near-field and far-field channel structures, leading to degraded estimation performance. This paper introduces DiffPace, a diffusion-based plug-and-play method for channel estimation. DiffPace uses a diffusion model (DM) to capture the channel distribution based on the hybrid spherical and planar-wave (HPSM) model. By applying the plug-and-play approach, it leverages the DM as prior knowledge, improving CE accuracy. Moreover, DM performs inference by solving an ordinary differential equation, minimizing the number of required inference steps compared with stochastic sampling method. Experimental results show that DiffPace achieves competitive CE performance, attaining -15 dB normalized mean square error (NMSE) at a signal-to-noise ratio (SNR) of 10 dB, with 90\% fewer inference steps compared to state-of-the-art schemes, simultaneously providing high estimation precision and enhanced computational efficiency.

LGMay 13, 2025

PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts

Yang Su, Na Yan, Yansha Deng et al.

Large language models (LLMs) hosted on cloud servers alleviate the computational and storage burdens on local devices but raise privacy concerns due to sensitive data transmission and require substantial communication bandwidth, which is challenging in constrained environments. In contrast, small language models (SLMs) running locally enhance privacy but suffer from limited performance on complex tasks. To balance computational cost, performance, and privacy protection under bandwidth constraints, we propose a privacy-aware wireless collaborative mixture of experts (PWC-MoE) framework. Specifically, PWC-MoE employs a sparse privacy-aware gating network to dynamically route sensitive tokens to privacy experts located on local clients, while non-sensitive tokens are routed to non-privacy experts located at the remote base station. To achieve computational efficiency, the gating network ensures that each token is dynamically routed to and processed by only one expert. To enhance scalability and prevent overloading of specific experts, we introduce a group-wise load-balancing mechanism for the gating network that evenly distributes sensitive tokens among privacy experts and non-sensitive tokens among non-privacy experts. To adapt to bandwidth constraints while preserving model performance, we propose a bandwidth-adaptive and importance-aware token offloading scheme. This scheme incorporates an importance predictor to evaluate the importance scores of non-sensitive tokens, prioritizing the most important tokens for transmission to the base station based on their predicted importance and the available bandwidth. Experiments demonstrate that the PWC-MoE framework effectively preserves privacy and maintains high performance even in bandwidth-constrained environments, offering a practical solution for deploying LLMs in privacy-sensitive and bandwidth-limited scenarios.

LGNov 10, 2024

HAFLQ: Heterogeneous Adaptive Federated LoRA Fine-tuned LLM with Quantization

Yang Su, Na Yan, Yansha Deng et al.

Federated fine-tuning of pre-trained Large Language Models (LLMs) enables task-specific adaptation across diverse datasets while preserving privacy. However, challenges such as high computational and memory demands, heterogeneous client resources, bandwidth constraints, and ineffective global aggregation hinder its efficiency. To address these issues, we propose HAFLQ (Heterogeneous Adaptive Federated Low-Rank Adaptation Fine-tuned LLM with Quantization), a novel framework for efficient and scalable federated fine-tuning of LLMs in heterogeneous environments. To reduce memory and computation demands, we propose a salience-driven adaptive LLM quantization framework that evaluates the importance of transformer blocks using a salience metric and applies adaptive block-wise quantization accordingly. To handle heterogeneous computational capabilities, we propose an importance-based parameter truncation and freezing scheme. To address communication bottlenecks, we propose an importance-aware bandwidth-adaptive quantization method, which dynamically adjusts parameter precision based on importance and bandwidth constraints. To improve global model aggregation, we propose an adaptive rank-1 matrix-level aggregation strategy, which prevents information dilution and accelerates convergence by aggregating only updated rank-1 matrices from clients. Experimental results on the text classification task demonstrate that HAFLQ reduces memory usage by 31%, lowers communication cost by 49%, improves accuracy by 50%, and achieves faster convergence compared to the baseline method.

ITNov 25, 2020

Deep Learning-based Resource Allocation For Device-to-Device Communication

Woongsup Lee, Robert Schober

In this paper, a deep learning (DL) framework for the optimization of the resource allocation in multi-channel cellular systems with device-to-device (D2D) communication is proposed. Thereby, the channel assignment and discrete transmit power levels of the D2D users, which are both integer variables, are optimized to maximize the overall spectral efficiency whilst maintaining the quality-of-service (QoS) of the cellular users. Depending on the availability of channel state information (CSI), two different configurations are considered, namely 1) centralized operation with full CSI and 2) distributed operation with partial CSI, where in the latter case, the CSI is encoded according to the capacity of the feedback channel. Instead of solving the resulting resource allocation problem for each channel realization, a DL framework is proposed, where the optimal resource allocation strategy for arbitrary channel conditions is approximated by deep neural network (DNN) models. Furthermore, we propose a new training strategy that combines supervised and unsupervised learning methods and a local CSI sharing strategy to achieve near-optimal performance while enforcing the QoS constraints of the cellular users and efficiently handling the integer optimization variables based on a few ground-truth labels. Our simulation results confirm that near-optimal performance can be attained with low computation time, which underlines the real-time capability of the proposed scheme. Moreover, our results show that not only the resource allocation strategy but also the CSI encoding strategy can be efficiently determined using a DNN. Furthermore, we show that the proposed DL framework can be easily extended to communications systems with different design objectives.

ITOct 19, 2020

A Comprehensive Overview on 5G-and-Beyond Networks with UAVs: From Communications to Sensing and Intelligence

Qingqing Wu, Jie Xu, Yong Zeng et al.

Due to the advancements in cellular technologies and the dense deployment of cellular infrastructure, integrating unmanned aerial vehicles (UAVs) into the fifth-generation (5G) and beyond cellular networks is a promising solution to achieve safe UAV operation as well as enabling diversified applications with mission-specific payload data delivery. In particular, 5G networks need to support three typical usage scenarios, namely, enhanced mobile broadband (eMBB), ultra-reliable low-latency communications (URLLC), and massive machine-type communications (mMTC). On the one hand, UAVs can be leveraged as cost-effective aerial platforms to provide ground users with enhanced communication services by exploiting their high cruising altitude and controllable maneuverability in three-dimensional (3D) space. On the other hand, providing such communication services simultaneously for both UAV and ground users poses new challenges due to the need for ubiquitous 3D signal coverage as well as the strong air-ground network interference. Besides the requirement of high-performance wireless communications, the ability to support effective and efficient sensing as well as network intelligence is also essential for 5G-and-beyond 3D heterogeneous wireless networks with coexisting aerial and ground users. In this paper, we provide a comprehensive overview of the latest research efforts on integrating UAVs into cellular networks, with an emphasis on how to exploit advanced techniques (e.g., intelligent reflecting surface, short packet transmission, energy harvesting, joint communication and radar sensing, and edge intelligence) to meet the diversified service requirements of next-generation wireless systems. Moreover, we highlight important directions for further investigation in future work.