Li Qiao

IT
h-index116
9papers
196citations
Novelty56%
AI Score32

9 Papers

SYNov 25, 2016
Computationally Efficient Unscented Kalman Filtering Techniques for Launch Vehicle Navigation using a Space-borne GPS Receiver

Sanat Biswas, Li Qiao, Andrew Dempster

The Extended Kalman Filter (EKF) is a well established technique for position and velocity estimation. However, the performance of the EKF degrades considerably in highly non-linear system applications as it requires local linearisation in its prediction stage. The Unscented Kalman Filter (UKF) was developed to address the non-linearity in the system by deterministic sampling. The UKF provides better estimation accuracy than the EKF for highly non-linear systems. However, the UKF requires multiple propagations of sampled state vectors in the measurement interval, which results in higher processing time than for the EKF. This paper proposes an application of two newly developed UKF variants in launch vehicle navigation. These two algorithms called the Single Propagation Unscented Kalman Filter (SPUKF) and the Extrapolated Single Propagation Unscented Kalman Filter (ESPUKF), reduce the processing time of the original UKF significantly and provide estimation accuracies comparable to the UKF. The estimation performance of the SPUKF and the ESPUKF is demonstrated using Falcon 9 V1.1 launch vehicle in CRS-5 mission scenario. The launch vehicle trajectory for the mission is generated using publicly available mission parameters. A SPIRENT GNSS simulator is used to generate the received GPS signal on the trajectory. Pseudo-range observations are used in the EKF, UKF, SPUKF and the ESPUKF separately and the estimation accuracies are compared. The results show that the estimation errors of the SPUKF and the ESPUKF are 15.44% and 10.52% higher than the UKF respectively. The processing time reduces by 83% for the SPUKF and 69.14% for the ESPUKF compared to the UKF.

SYNov 25, 2016
Position and Velocity estimation of Re-entry Vehicles using Fast Unscented Kalman Filters

Sanat Biswas, Li Qiao, Andrew Dempster

Application of two new UKF based estimation techniques with reduced processing time in re-entry vehicle position and velocity estimation problem using ground-based range and elevation measurements is presented. The first method is called the Single Propagation Unscented Kalman Filter (SPUKF) where, the a postiriori state is propagated only once and then the sampled sigma points at the next time state are approximated by the first-order Taylor Series terms. In the second method called the Extrapolated Single Propagation Unscented Kalman Filter (ESPUKF), the sigma points are approximated to the second-order Taylor Series terms using the Richardson Extrapolation. The EKF, SPUKF, ESPUKF and the UKF are utilized in a re-entry vehicle navigation scenario using range and elevation measurements. The estimation accuracies and the processing times for different algorithms are compared for the scenario. The result demonstrates that the UKF provides better accuracy than the EKF but requires more processing time. The SPUKF accuracy is better than the EKF and the processing time is significantly less than the UKF. However, the accuracy of the SPUKF is less than the UKF. The ESPUKF provides estimation accuracy comparable to the UKF and the processing time is also significantly reduced.

LGJul 30, 2024
Private Collaborative Edge Inference via Over-the-Air Computation

Selim F. Yilmaz, Burak Hasircioglu, Li Qiao et al.

We consider collaborative inference at the wireless edge, where each client's model is trained independently on its local dataset. Clients are queried in parallel to make an accurate decision collaboratively. In addition to maximizing the inference accuracy, we also want to ensure the privacy of local models. To this end, we leverage the superposition property of the multiple access channel to implement bandwidth-efficient multi-user inference methods. We propose different methods for ensemble and multi-view classification that exploit over-the-air computation (OAC). We show that these schemes perform better than their orthogonal counterparts with statistically significant differences while using fewer resources and providing privacy guarantees. We also provide experimental results verifying the benefits of the proposed OAC approach to multi-user inference, and perform an ablation study to demonstrate the effectiveness of our design choices. We share the source code of the framework publicly on Github to facilitate further research and reproducibility.

ITJun 5, 2024Code
CSI-GPT: Integrating Generative Pre-Trained Transformer with Federated-Tuning to Acquire Downlink Massive MIMO Channels

Ye Zeng, Li Qiao, Zhen Gao et al.

In massive multiple-input multiple-output (MIMO) systems, how to reliably acquire downlink channel state information (CSI) with low overhead is challenging. In this work, by integrating the generative pre-trained Transformer (GPT) with federated-tuning, we propose a CSI-GPT approach to realize efficient downlink CSI acquisition. Specifically, we first propose a Swin Transformer-based channel acquisition network (SWTCAN) to acquire downlink CSI, where pilot signals, downlink channel estimation, and uplink CSI feedback are jointly designed. Furthermore, to solve the problem of insufficient training data, we propose a variational auto-encoder-based channel sample generator (VAE-CSG), which can generate sufficient CSI samples based on a limited number of high-quality CSI data obtained from the current cell. The CSI dataset generated from VAE-CSG will be used for pre-training SWTCAN. To fine-tune the pre-trained SWTCAN for improved performance, we propose an online federated-tuning method, where only a small amount of SWTCAN parameters are unfrozen and updated using over-the-air computation, avoiding the high communication overhead caused by aggregating the complete CSI samples from user equipment (UEs) to the BS for centralized fine-tuning. Simulation results verify the advantages of the proposed SWTCAN and the communication efficiency of the proposed federated-tuning method. Our code is publicly available at https://github.com/BIT-ZY/CSI-GPT

ITMar 25, 2024
Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models

Li Qiao, Mahdi Boloursaz Mashhadi, Zhen Gao et al.

Generative foundation AI models have recently shown great success in synthesizing natural signals with high perceptual quality using only textual prompts and conditioning signals to guide the generation process. This enables semantic communications at extremely low data rates in future wireless networks. In this paper, we develop a latency-aware semantic communications framework with pre-trained generative models. The transmitter performs multi-modal semantic decomposition on the input signal and transmits each semantic stream with the appropriate coding and communication schemes based on the intent. For the prompt, we adopt a re-transmission-based scheme to ensure reliable transmission, and for the other semantic modalities we use an adaptive modulation/coding scheme to achieve robustness to the changing wireless channel. Furthermore, we design a semantic and latency-aware scheme to allocate transmission power to different semantic modalities based on their importance subjected to semantic quality constraints. At the receiver, a pre-trained generative model synthesizes a high fidelity signal using the received multi-stream semantics. Simulation results demonstrate ultra-low-rate, low-latency, and channel-adaptive semantic communications.

MMFeb 17, 2025
Token Communications: A Large Model-Driven Framework for Cross-modal Context-aware Semantic Communications

Li Qiao, Mahdi Boloursaz Mashhadi, Zhen Gao et al.

In this paper, we introduce token communications (TokCom), a large model-driven framework to leverage cross-modal context information in generative semantic communications (GenSC). TokCom is a new paradigm, motivated by the recent success of generative foundation models and multimodal large language models (GFM/MLLMs), where the communication units are tokens, enabling efficient transformer-based token processing at the transmitter and receiver. In this paper, we introduce the potential opportunities and challenges of leveraging context in GenSC, explore how to integrate GFM/MLLMs-based token processing into semantic communication systems to leverage cross-modal context effectively at affordable complexity, present the key principles for efficient TokCom at various layers in future wireless networks. In a typical image semantic communication setup, we demonstrate a significant improvement of the bandwidth efficiency, achieved by TokCom by leveraging the context information among tokens. Finally, the potential research directions are identified to facilitate adoption of TokCom in future wireless networks.

SPFeb 19, 2025
Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model

Hang Yin, Li Qiao, Yu Ma et al.

Despite significant advancements in traditional syntactic communications based on Shannon's theory, these methods struggle to meet the requirements of 6G immersive communications, especially under challenging transmission conditions. With the development of generative artificial intelligence (GenAI), progress has been made in reconstructing videos using high-level semantic information. In this paper, we propose a scalable generative video semantic communication framework that extracts and transmits semantic information to achieve high-quality video reconstruction. Specifically, at the transmitter, description and other condition signals (e.g., first frame, sketches, etc.) are extracted from the source video, functioning as text and structural semantics, respectively. At the receiver, the diffusion-based GenAI large models are utilized to fuse the semantics of the multiple modalities for reconstructing the video. Simulation results demonstrate that, at an ultra-low channel bandwidth ratio (CBR), our scheme effectively captures semantic information to reconstruct videos aligned with human perception under different signal-to-noise ratios. Notably, the proposed ``First Frame+Desc." scheme consistently achieves CLIP score exceeding 0.92 at CBR = 0.0057 for SNR > 0 dB. This demonstrates its robust performance even under low SNR conditions.

ITNov 4, 2024
Communicate Less, Synthesize the Rest: Latency-aware Intent-based Generative Semantic Multicasting with Diffusion Models

Xinkai Liu, Mahdi Boloursaz Mashhadi, Li Qiao et al.

Generative diffusion models (GDMs) have recently shown great success in synthesizing multimedia signals with high perceptual quality, enabling highly efficient semantic communications in future wireless networks. In this paper, we develop an intent-aware generative semantic multicasting framework utilizing pre-trained diffusion models. In the proposed framework, the transmitter decomposes the source signal into multiple semantic classes based on the multi-user intent, i.e. each user is assumed to be interested in details of only a subset of the semantic classes. To better utilize the wireless resources, the transmitter sends to each user only its intended classes, and multicasts a highly compressed semantic map to all users over shared wireless resources that allows them to locally synthesize the other classes, namely non-intended classes, utilizing pre-trained diffusion models. The signal retrieved at each user is thereby partially reconstructed and partially synthesized utilizing the received semantic map. We design a communication/computation-aware scheme for per-class adaptation of the communication parameters, such as the transmission power and compression rate, to minimize the total latency of retrieving signals at multiple receivers, tailored to the prevailing channel conditions as well as the users' reconstruction/synthesis distortion/perception requirements. The simulation results demonstrate significantly reduced per-user latency compared with non-generative and intent-unaware multicasting benchmarks while maintaining high perceptual quality of the signals retrieved at the users.

LGMay 26, 2025
Residual Cross-Attention Transformer-Based Multi-User CSI Feedback with Deep Joint Source-Channel Coding

Hengwei Zhang, Minghui Wu, Li Qiao et al.

This letter proposes a deep-learning (DL)-based multi-user channel state information (CSI) feedback framework for massive multiple-input multiple-output systems, where the deep joint source-channel coding (DJSCC) is utilized to improve the CSI reconstruction accuracy. Specifically, we design a multi-user joint CSI feedback framework, whereby the CSI correlation of nearby users is utilized to reduce the feedback overhead. Under the framework, we propose a new residual cross-attention transformer architecture, which is deployed at the base station to further improve the CSI feedback performance. Moreover, to tackle the "cliff-effect" of conventional bit-level CSI feedback approaches, we integrated DJSCC into the multi-user CSI feedback, together with utilizing a two-stage training scheme to adapt to varying uplink noise levels. Experimental results demonstrate the superiority of our methods in CSI feedback performance, with low network complexity and better scalability.