Robert Caiming Qiu

9papers

56citations

Novelty53%

AI Score51

Ranked #39,775 of 205,806 authors (top 19%)#109 in IT (top 12%)

9 Papers

96.4ITJun 2

Encoded Jamming Secure Communication for RIS-Assisted Systems

Hao Yang, Hao Xu, Kai Wan et al.

This paper investigates a cooperative jamming (CJ)-aided secure wireless communication system. Conventional CJ schemes transmit Gaussian noise (GN) to improve security, which inherently degrades the legitimate receiver's performance. While encoded jamming (EJ) mitigates this interference, its superiority over GN is highly channel-dependent. To overcome this limitation, we introduce a joint optimization framework integrating a reconfigurable intelligent surface (RIS) with EJ to maximize the secrecy rate. \RED{We first establish the information-theoretic relationship between the EJ and GN schemes, identifying the spatial channel conditions that limit EJ performance. For the multiple-input single-output (MISO) scenario, we analytically derive the ergodic secrecy gap as the number of RIS elements grows large and obtain a positive EJ-over-GN gap under explicit power and channel conditions.} Furthermore, for the general multiple-input multiple-output (MIMO) setup, we develop a low-complexity algorithm based on the weighted minimum mean-square-error (WMMSE) framework to handle the resulting non-smooth max-min structure through a WMMSE-based mode-selection framework. By introducing a parameterized function abstraction, the transmit precoding matrices and the RIS phase shift matrix are jointly optimized via block coordinate descent (BCD). Simulation results support the analysis and show that, under the evaluated settings, RIS-assisted EJ can overcome the identified spatial bottlenecks and outperform the optimized GN baseline.

82.3ITMay 26

Reliability-Constrained Blind Beam Alignment for Backscatter-MIMO mounted Target in Cluttered Multipath Channels

Xuehui Dong, Kai Wan, Gui Zhou et al.

Practical ISAC is constrained by static clutter and NLoS multipath, which obscure target-coupled echoes and induce spurious peaks for beam alignment. Existing receiver-side methods largely model targets as passive scatterers, limiting the structural separability of target echoes from the environment. This paper establishes a structural correspondence between these limitations and target-side Backscatter-MIMO responses: reflection modulation enables waveform-domain separation from unmodulated clutter, while retro-directional passive beamforming concentrates the tagged echo toward the BS-facing direction and suppresses NLoS-induced false-peak locking. To operationalize this correspondence, dual-end spatial locking is required to overcome cascaded backscatter loss and provide beam-domain angular information. We propose a downlink-triggered blind dual-end alignment protocol that jointly selects the BS and Backscatter-MIMO codeword indices from the tagged echo observed at the BS, without pilots, CSI feedback, or target synchronization. We further derive a clutter-aware remodulation waveform robust to fractional timing offsets and construct adjustable-width BS/Backscatter-MIMO codebooks via quadratic phase spoiling. For reliability characterization, we derive closed-form expressions for the coherence-averaged end-to-end success probability. The analysis shows that beam narrowing is not universally beneficial: in NLoS-dominated regimes, enlarging the array aperture may degrade alignment reliability. The optimal beamwidth is instead governed by cross-phase competition between discovery and alignment, yielding a nontrivial feasible region with an analytically characterized boundary. Simulations validate the analysis and demonstrate improved reliability-gated locked-link performance under strong clutter, severe NLoS multipath, and finite coherence time.

CVAug 7, 2024

PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model

Yunlong Huang, Junshuo Liu, Ke Xian et al.

Transformers have significantly advanced the field of 3D human pose estimation (HPE). However, existing transformer-based methods primarily use self-attention mechanisms for spatio-temporal modeling, leading to a quadratic complexity, unidirectional modeling of spatio-temporal relationships, and insufficient learning of spatial-temporal correlations. Recently, the Mamba architecture, utilizing the state space model (SSM), has exhibited superior long-range modeling capabilities in a variety of vision tasks with linear complexity. In this paper, we propose PoseMamba, a novel purely SSM-based approach with linear complexity for 3D human pose estimation in monocular video. Specifically, we propose a bidirectional global-local spatio-temporal SSM block that comprehensively models human joint relations within individual frames as well as temporal correlations across frames. Within this bidirectional global-local spatio-temporal SSM block, we introduce a reordering strategy to enhance the local modeling capability of the SSM. This strategy provides a more logical geometric scanning order and integrates it with the global SSM, resulting in a combined global-local spatial scan. We have quantitatively and qualitatively evaluated our approach using two benchmark datasets: Human3.6M and MPI-INF-3DHP. Extensive experiments demonstrate that PoseMamba achieves state-of-the-art performance on both datasets while maintaining a smaller model size and reducing computational costs. The code and models will be released.

32.0ITApr 14

On Secure Gradient Coding with Uncoded Groupwise Keys

Xudong You, Kai Wan, Xiang Zhang et al.

This paper considers a new secure gradient coding problem with uncoded groupwise keys, formalized as a (K, N, N_r, M, S) secure gradient coding model, where a user aims to compute the sum of the gradients from K datasets with the assistance of N distributed servers. We consider arbitrary heterogeneous data assignment, where each dataset is assigned to at least M servers. The user should recover the sum of gradients from the transmissions of any N_r servers. The security constraint guarantees that even if the user receives the transmitted messages from all servers, it cannot obtain any other information about the datasets except the sum of gradients. Compared to existing secure gradient coding works, we introduce a practical constraint on secret keys, namely uncoded groupwise keys, where the keys are mutually independent and each key is shared by precisely S servers. An achievable secure gradient coding scheme with uncoded groupwise keys is proposed, which is then proven to be optimal if S > M and to be order optimal within a factor of 2 otherwise.

70.6ITMay 7

A Low-Complexity Framework for Multi-access Coded Caching Systems with Arbitrary User-cache Access Topology

Ting Yang, Kai Wan, Minquan Cheng et al.

This paper studies the multi-access coded caching (MACC) problem with arbitrary user-cache access topology, which extends existing MACC models that rely on highly structured and combinatorially designed topologies. We consider a MACC system consisting of a single server, $Λ$ cache-nodes, and $K$ user-nodes. The server stores $N$ equal-size files, each cache-node has a storage capacity of $M$ files, and each user-node $k\in[K]$ can access an arbitrary subset of cache-nodes $\mathcal{A}_k\subseteq[Λ]$ and retrieve the cached content stored in cache-nodes $\mathcal{A}_k$. The objective is to design a universal framework for the MACC delivery problem. Decoding conflicts among the requested packets are captured by a conflict graph, and the design of the delivery is reduced to a graph coloring problem, where achieving a lower transmission load corresponds to coloring the graph using fewer colors. Under this formulation, the classical DSatur algorithm achieves a transmission load close to the index-coding (IC) converse bound, thereby providing a practical benchmark. However, its computational complexity becomes prohibitive for large-scale graphs. To overcome this limitation, we develop a learning-driven approach using graph neural networks (GNNs) that efficiently constructs coded multicast transmissions with performance close to the theoretical bounds and generalizes across different user-cache access topologies and numbers of users. In addition, we extend the IC converse bound to MACC systems with arbitrary access topology and propose a low-complexity greedy approximation that closely matches the IC converse bound. Numerical results demonstrate that the proposed approach achieves performance close to the DSatur algorithm and the IC converse bound, while significantly reducing computational complexity, making it well-suited for large-scale MACC systems.

AIJul 31, 2024

TRGR: Transmissive RIS-aided Gait Recognition Through Walls

Yunlong Huang, Junshuo Liu, Jianan Zhang et al.

Gait recognition with radio frequency (RF) signals enables many potential applications requiring accurate identification. However, current systems require individuals to be within a line-of-sight (LOS) environment and struggle with low signal-to-noise ratio (SNR) when signals traverse concrete and thick walls. To address these challenges, we present TRGR, a novel transmissive reconfigurable intelligent surface (RIS)-aided gait recognition system. TRGR can recognize human identities through walls using only the magnitude measurements of channel state information (CSI) from a pair of transceivers. Specifically, by leveraging transmissive RIS alongside a configuration alternating optimization algorithm, TRGR enhances wall penetration and signal quality, enabling accurate gait recognition. Furthermore, a residual convolution network (RCNN) is proposed as the backbone network to learn robust human information. Experimental results confirm the efficacy of transmissive RIS, highlighting the significant potential of transmissive RIS in enhancing RF-based gait recognition systems. Extensive experiment results show that TRGR achieves an average accuracy of 97.88\% in identifying persons when signals traverse concrete walls, demonstrating the effectiveness and robustness of TRGR.

CVSep 15, 2020

A Robust and Reliable Point Cloud Recognition Network Under Rigid Transformation

Dongrui Liu, Chuanchuan Chen, Changqing Xu et al.

Point cloud recognition is an essential task in industrial robotics and autonomous driving. Recently, several point cloud processing models have achieved state-of-the-art performances. However, these methods lack rotation robustness, and their performances degrade severely under random rotations, failing to extend to real-world scenarios with varying orientations. To this end, we propose a method named Self Contour-based Transformation (SCT), which can be flexibly integrated into various existing point cloud recognition models against arbitrary rotations. SCT provides efficient rotation and translation invariance by introducing Contour-Aware Transformation (CAT), which linearly transforms Cartesian coordinates of points to translation and rotation-invariant representations. We prove that CAT is a rotation and translation-invariant transformation based on the theoretical analysis. Furthermore, the Frame Alignment module is proposed to enhance discriminative feature extraction by capturing contours and transforming self contour-based frames into intra-class frames. Extensive experimental results show that SCT outperforms the state-of-the-art approaches under arbitrary rotations in effectiveness and efficiency on synthetic and real-world benchmarks. Furthermore, the robustness and generality evaluations indicate that SCT is robust and is applicable to various point cloud processing models, which highlights the superiority of SCT in industrial applications.

SPMay 14, 2019

LEMO: Learn to Equalize for MIMO-OFDM Systems with Low-Resolution ADCs

Lei Chu, Ling Pei, Husheng Li et al.

This paper develops a new deep neural network optimized equalization framework for massive multiple input multiple output orthogonal frequency division multiplexing (MIMOOFDM) systems that employ low-resolution analog-to-digital converters (ADCs) at the base station (BS). The use of lowresolution ADCs could largely reduce hardware complexity and circuit power consumption, however, it makes the channel station information almost blind to the BS, hence causing difficulty in solving the equalization problem. In this paper, we consider a supervised learning architecture, where the goal is to learn a representative function that can predict the targets (constellation points) from the inputs (outputs of the low-resolution ADCs) based on the labeled training data (pilot signals). Especially, our main contributions are two-fold: 1) First, we design a new activation function, whose outputs are close to the constellation points when the parameters are finally optimized, to help us fully exploit the stochastic gradient descent method for the discrete optimization problem. 2) Second, an unsupervised loss is designed and then added to the optimization objective, aiming to enhance the representation ability (so-called generalization). Lastly, various experimental results confirm the superiority of the proposed equalizer over some existing ones, particularly when the statistics of the channel state information are unclear.

MEJan 31, 2015

A Random Matrix Theoretical Approach to Early Event Detection in Smart Grid

Xing He, Robert Caiming Qiu, Qian Ai et al.

Power systems are developing very fast nowadays, both in size and in complexity; this situation is a challenge for Early Event Detection (EED). This paper proposes a data- driven unsupervised learning method to handle this challenge. Specifically, the random matrix theories (RMTs) are introduced as the statistical foundations for random matrix models (RMMs); based on the RMMs, linear eigenvalue statistics (LESs) are defined via the test functions as the system indicators. By comparing the values of the LES between the experimental and the theoretical ones, the anomaly detection is conducted. Furthermore, we develop 3D power-map to visualize the LES; it provides a robust auxiliary decision-making mechanism to the operators. In this sense, the proposed method conducts EED with a pure statistical procedure, requiring no knowledge of system topologies, unit operation/control models, etc. The LES, as a key ingredient during this procedure, is a high dimensional indictor derived directly from raw data. As an unsupervised learning indicator, the LES is much more sensitive than the low dimensional indictors obtained from supervised learning. With the statistical procedure, the proposed method is universal and fast; moreover, it is robust against traditional EED challenges (such as error accumulations, spurious correlations, and even bad data in core area). Case studies, with both simulated data and real ones, validate the proposed method. To manage large-scale distributed systems, data fusion is mentioned as another data processing ingredient.