Ju Liu

IT
h-index25
16papers
2,166citations
Novelty42%
AI Score55

16 Papers

21.1ITMay 23
Two-Stage Coded-Sliding Beam Training and QoS-Constrained Sum-Rate Maximization for SIM-Assisted Wireless Communications

Qian Zhang, Ju Liu, Yao Ge et al.

Stacked intelligent metasurfaces (SIM) provide a cost-effective and scalable solution for large-scale antenna communications.However, efficient channel state information acquisition and phase shift optimization remain critical challenges. In this paper, we develop a unified framework of low-complexity algorithms for SIM-assisted communication systems to address these issues. Specifically, we propose a generalized two-step codebook construction (TSCC) method that leverages two-dimensional angular-domain decoupling to transform planar array beamformer design into two independent one-dimensional linear array beamformer design problems, efficiently solved via the Gerchberg-Saxton algorithm and our proposed majorization-minimization-based proximal distance (PDMM) algorithm. We further develop a two-stage coded-sliding beam training (TSCSBT) method for low-overhead and high-accuracy beam training, where error-correcting codes are embedded in the first-stage training to enhance robustness against noise, and sliding sampling is subsequently performed around the matched angular samples to improve angular resolution. The proposed framework is further extended to multi-path user channels. Finally, a variable decoupling-based block successive upper bound minimization (VD-BSUM) algorithm is proposed to directly solve the QoS-constrained sum-rate maximization problem through closed-form iterative updates with substantially reduced computational complexity. Simulation results demonstrate the effectiveness of the proposed methods in achieving precise beam pattern realization, improved beam training accuracy and angular resolution, and enhanced sum-rate performance.

NADec 28, 2018
An energy-stable mixed formulation for isogeometric analysis of incompressible hyper-elastodynamics

Ju Liu, Alison L. Marsden, Zhen Tao

We develop a mixed formulation for incompressible hyper-elastodynamics based on a continuum modeling framework recently developed and smooth generalizations of the Taylor-Hood element based on non-uniform rational B-splines (NURBS). This continuum formulation draws a link between computational fluid dynamics and computational solid dynamics. This link inspires an energy stability estimate for the spatial discretization, which favorably distinguishes the formulation from the conventional mixed formulations for finite elasticity. The inf-sup condition is utilized to provide a bound for the pressure field. The generalized-$α$ method is applied for temporal discretization, and a nested block preconditioner is invoked for the solution procedure. The inf-sup stability for different pairs of NURBS elements is elucidated through numerical assessment. The convergence rate of the proposed formulation with various combinations of mixed elements is examined by the manufactured solution method. The numerical scheme is also examined under compressive and tensile loads for isotropic and anisotropic hyperelastic materials. Finally, a suite of dynamic problems is numerically studied to corroborate the stability and conservation properties.

LGMar 1, 2022
Nonlinear Kernel Support Vector Machine with 0-1 Soft Margin Loss

Ju Liu, Ling-Wei Huang, Yuan-Hai Shao et al.

Recent advance on linear support vector machine with the 0-1 soft margin loss ($L_{0/1}$-SVM) shows that the 0-1 loss problem can be solved directly. However, its theoretical and algorithmic requirements restrict us extending the linear solving framework to its nonlinear kernel form directly, the absence of explicit expression of Lagrangian dual function of $L_{0/1}$-SVM is one big deficiency among of them. In this paper, by applying the nonparametric representation theorem, we propose a nonlinear model for support vector machine with 0-1 soft margin loss, called $L_{0/1}$-KSVM, which cunningly involves the kernel technique into it and more importantly, follows the success on systematically solving its linear task. Its optimal condition is explored theoretically and a working set selection alternating direction method of multipliers (ADMM) algorithm is introduced to acquire its numerical solution. Moreover, we firstly present a closed-form definition to the support vector (SV) of $L_{0/1}$-KSVM. Theoretically, we prove that all SVs of $L_{0/1}$-KSVM are only located on the parallel decision surfaces. The experiment part also shows that $L_{0/1}$-KSVM has much fewer SVs, simultaneously with a decent predicting accuracy, when comparing to its linear peer $L_{0/1}$-SVM and the other six nonlinear benchmark SVM classifiers.

86.7ITMar 19
Robust Beamforming for Practical RIS-Aided RSMA Systems with Imperfect SIC under Transceiver Hardware Impairments

Xuejun Cheng, Qian Zhang, Yunnuo Xu et al.

Reconfigurable intelligent surface (RIS)-aided rate-splitting multiple access (RSMA) systems have demonstrated remarkable potential in enhancing spectral efficiency. However, most existing works rely on ideal hardware, which is unrealistic.In practical deployments, RIS elements suffer from amplitude-phase coupling, where transceivers are subject to hardware impairments (HWI), and successive interference cancellation (SIC) in RSMA networks cannot achieve perfect interference elimination for decoded signals.To address these limitations, we investigate a robust beamforming design for RIS-aided RSMA systems under practical hardware imperfections. We first characterize the asymptotic signal-to-noise ratio (SNR) of practical RIS systems when the beamformer is designed based on ideal RIS model, thereby theoretically quantifying the resulting performance degradation. We then derive a closed-form expression for the distortion noise power induced by transceiver HWI, while also accounting for residual interference due to imperfect SIC. Building on these insights, we establish a comprehensive system model that jointly incorporates all hardware-induced impairments and formulate a multiuser sum rate maximization problem. To solve the resulting non-convex optimization problem, we develop an efficient block variable relaxation algorithm. Simulation results verify that the proposed scheme significantly outperforms conventional non-orthogonal multiple access (NOMA) approaches, and achieves superior robustness compared with benchmark schemes neglecting HWI, imperfect SIC, or amplitude-phase coupling.

90.6ITMar 10
Joint Precoding and Phase-Shift Optimization for Beyond-Diagonal RIS-Aided ISAC System

Xuejun Cheng, Qian Zhang, Yuhui Jiao et al.

Beyond diagonal reconfigurable intelligent surfaces (BD-RIS) can realize the interconnection between reflecting elements through the impedance network, thereby providing a new approach for the performance improvement of integrated sensing and communication (ISAC) systems. This paper investigates the optimization problem of BD-RIS-aided multiuser ISAC system, aiming to achieve the flexible design of trade-offs between communication and sensing performance. Specifically, we propose an optimization framework jointly combining the multiuser interference management and sensing beam gain approximation method. By jointly optimizing the precoding vector and RIS phase-shift matrix, improving the multiuser communication sum rate through the proposed interference management method, and enhancing the system sensing performance through the beam gain approximation method. For the resulting non-convex weighted optimization problem, we employ the alternating optimization (AO) algorithm to decouple it into two subproblems of precoding vector and phase-shift matrix optimization, with each step admitting closed-form solutions.Simulation results demonstrate that the proposed BD-RIS-aided ISAC system can achieve significant improvement in the trade-offs between communication and sensing performance than the traditional diagonal RIS, verifying the effectiveness of the proposed optimization framework.

CLDec 13, 2023
Extending Whisper with prompt tuning to target-speaker ASR

Hao Ma, Zhiyuan Peng, Mingjie Shao et al.

Target-speaker automatic speech recognition (ASR) aims to transcribe the desired speech of a target speaker from multi-talker overlapped utterances. Most of the existing target-speaker ASR (TS-ASR) methods involve either training from scratch or fully fine-tuning a pre-trained model, leading to significant training costs and becoming inapplicable to large foundation models. This work leverages prompt tuning, a parameter-efficient fine-tuning approach, to extend Whisper, a large-scale single-talker ASR model, to TS-ASR. Variants of prompt tuning approaches along with their configurations are explored and optimized for TS-ASR.Experimental results show that prompt tuning can achieve performance comparable to state-of-the-art full training approaches while only requiring about 1\% of task-specific model parameters. Notably, the original Whisper's features, such as inverse text normalization and timestamp tagging, are retained in target-speaker ASR, keeping the generated transcriptions natural and informative.

CRDec 29, 2023
MVPatch: More Vivid Patch for Adversarial Camouflaged Attacks on Object Detectors in the Physical World

Zheng Zhou, Hongbo Zhao, Ju Liu et al.

Recent studies have shown that Adversarial Patches (APs) can effectively manipulate object detection models. However, the conspicuous patterns often associated with these patches tend to attract human attention, posing a significant challenge. Existing research has primarily focused on enhancing attack efficacy in the physical domain while often neglecting the optimization of stealthiness and transferability. Furthermore, applying APs in real-world scenarios faces major challenges related to transferability, stealthiness, and practicality. To address these challenges, we introduce generalization theory into the context of APs, enabling our iterative process to simultaneously enhance transferability and refine visual correlation with realistic images. We propose a Dual-Perception-Based Framework (DPBF) to generate the More Vivid Patch (MVPatch), which enhances transferability, stealthiness, and practicality. The DPBF integrates two key components: the Model-Perception-Based Module (MPBM) and the Human-Perception-Based Module (HPBM), along with regularization terms. The MPBM employs ensemble strategy to reduce object confidence scores across multiple detectors, thereby improving AP transferability with robust theoretical support. Concurrently, the HPBM introduces a lightweight method for achieving visual similarity, creating natural and inconspicuous adversarial patches without relying on additional generative models. The regularization terms further enhance the practicality of the generated APs in the physical domain. Additionally, we introduce naturalness and transferability scores to provide an unbiased assessment of APs. Extensive experimental validation demonstrates that MVPatch achieves superior transferability and a natural appearance in both digital and physical domains, underscoring its effectiveness and stealthiness.

IVApr 15, 2025
Lightweight Medical Image Restoration via Integrating Reliable Lesion-Semantic Driven Prior

Pengcheng Zheng, Kecheng Chen, Jiaxin Huang et al.

Medical image restoration tasks aim to recover high-quality images from degraded observations, exhibiting emergent desires in many clinical scenarios, such as low-dose CT image denoising, MRI super-resolution, and MRI artifact removal. Despite the success achieved by existing deep learning-based restoration methods with sophisticated modules, they struggle with rendering computationally-efficient reconstruction results. Moreover, they usually ignore the reliability of the restoration results, which is much more urgent in medical systems. To alleviate these issues, we present LRformer, a Lightweight Transformer-based method via Reliability-guided learning in the frequency domain. Specifically, inspired by the uncertainty quantification in Bayesian neural networks (BNNs), we develop a Reliable Lesion-Semantic Prior Producer (RLPP). RLPP leverages Monte Carlo (MC) estimators with stochastic sampling operations to generate sufficiently-reliable priors by performing multiple inferences on the foundational medical image segmentation model, MedSAM. Additionally, instead of directly incorporating the priors in the spatial domain, we decompose the cross-attention (CA) mechanism into real symmetric and imaginary anti-symmetric parts via fast Fourier transform (FFT), resulting in the design of the Guided Frequency Cross-Attention (GFCA) solver. By leveraging the conjugated symmetric property of FFT, GFCA reduces the computational complexity of naive CA by nearly half. Extensive experimental results in various tasks demonstrate the superiority of the proposed LRformer in both effectiveness and efficiency.

MEDec 14, 2025
Robust Variational Bayes by Min-Max Median Aggregation

Jiawei Yan, Ju Liu, Weidong Liu et al.

We propose a robust and scalable variational Bayes (VB) framework designed to effectively handle contamination and outliers in dataset. Our approach partitions the data into $m$ disjoint subsets and formulates a joint optimization problem based on robust aggregation principles. A key insight is that the full posterior distribution is equivalent to the minimizer of the mean Kullback-Leibler (KL) divergence from the $m$-powered local posterior distributions. To enhance robustness, we replace the mean KL divergence with a min-max median formulation. The min-max formulation not only ensures consistency between the KL minimizer and the Evidence Lower Bound (ELBO) maximizer but also facilitates the establishment of improved statistical rates for the mean of variational posterior. We observe a notable discrepancy in the $m$-powered marginal log likelihood function contingent on the presence of local latent variables. To address this, we treat these two scenarios separately to guarantee the consistency of the aggregated variational posterior. Specifically, when local latent variables are present, we introduce an aggregate-and-rescale strategy. Theoretically, we provide a non-asymptotic analysis of our proposed posterior, incorporating a refined analysis of Bernstein-von Mises (BvM) theorem to accommodate a diverging number of subsets $m$. Our findings indicate that the two-stage approach yields a smaller approximation error compared to directly aggregating the $m$-powered local posteriors. Furthermore, we establish a nearly optimal statistical rate for the mean of the proposed posterior, advancing existing theories related to min-max median estimators. The efficacy of our method is demonstrated through extensive simulation studies.

ITMar 6
Beamforming Optimization for Extremely Large-Scale RIS-Aided Near-Field Secure Communications

Xiaotong Xu, Qian Zhang, Yunxiao Li et al.

This paper studies an extremely large-scale reconfigurable intelligent surface (XL-RIS)-aided near-field physical layer security (PLS) communication system, aiming to maximize the secrecy rate by jointly optimizing precoding vector at the BS and the reflection coefficient matrix at the XL-RIS. Artifi-cial jamming was introduced to further enhance communication security. To solve the non-convex secrecy rate problem, an alternate optimization-based algorithm is adopted to decompose it into two sub-problems. Specifically, when optimizing the transmit beamformer at the BS, the non-convex prob-lem is transformed into a convex one through the weighted minimum mean-square error and the successive convex approximation-based algorithms. For the optimization problem of the XL-RIS phase-shifting matrix, a low-complexity alternating direction method of multipliers-based algorithm is employed to enhance the flexibility of the design. The proposed algorithm is capable of accommodating discrete phase optimization for the XL-RIS, thus better aligning with practical system requirements. Simulation results demonstrate that when the eavesdropper reside in the same direction as the legitimate user and is located closer to the XL-RIS, the proposed scheme in this paper can still ensure the secure communication.

CVDec 27, 2019
Non-Cooperative Game Theory Based Rate Adaptation for Dynamic Video Streaming over HTTP

Hui Yuan, Huayong Fu, Ju Liu et al.

Dynamic Adaptive Streaming over HTTP (DASH) has demonstrated to be an emerging and promising multimedia streaming technique, owing to its capability of dealing with the variability of networks. Rate adaptation mechanism, a challenging and open issue, plays an important role in DASH based systems since it affects Quality of Experience (QoE) of users, network utilization, etc. In this paper, based on non-cooperative game theory, we propose a novel algorithm to optimally allocate the limited export bandwidth of the server to multi-users to maximize their QoE with fairness guaranteed. The proposed algorithm is proxy-free. Specifically, a novel user QoE model is derived by taking a variety of factors into account, like the received video quality, the reference buffer length, and user accumulated buffer lengths, etc. Then, the bandwidth competing problem is formulated as a non-cooperation game with the existence of Nash Equilibrium that is theoretically proven. Finally, a distributed iterative algorithm with stability analysis is proposed to find the Nash Equilibrium. Compared with state-of-the-art methods, extensive experimental results in terms of both simulated and realistic networking scenarios demonstrate that the proposed algorithm can produce higher QoE, and the actual buffer lengths of all users keep nearly optimal states, i.e., moving around the reference buffer all the time. Besides, the proposed algorithm produces no playback interruption.

IVDec 20, 2019
A Comprehensive Study and Comparison of Core Technologies for MPEG 3D Point Cloud Compression

Hao Liu, Hui Yuan, Qi Liu et al.

Point cloud based 3D visual representation is becoming popular due to its ability to exhibit the real world in a more comprehensive and immersive way. However, under a limited network bandwidth, it is very challenging to communicate this kind of media due to its huge data volume. Therefore, the MPEG have launched the standardization for point cloud compression (PCC), and proposed three model categories, i.e., TMC1, TMC2, and TMC3. Because the 3D geometry compression methods of TMC1 and TMC3 are similar, TMC1 and TMC3 are further merged into a new platform namely TMC13. In this paper, we first introduce some basic technologies that are usually used in 3D point cloud compression, then review the encoder architectures of these test models in detail, and finally analyze their rate distortion performance as well as complexity quantitatively for different cases (i.e., lossless geometry and lossless color, lossless geometry and lossy color, lossy geometry and lossy color) by using 16 benchmark 3D point clouds that are recommended by MPEG. Experimental results demonstrate that the coding efficiency of TMC2 is the best on average (especially for lossy geometry and lossy color compression) for dense point clouds while TMC13 achieves the optimal coding performance for sparse and noisy point clouds with lower time complexity.

CVNov 5, 2018
Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

Spyridon Bakas, Mauricio Reyes, Andras Jakab et al.

Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles disseminated across multi-parametric magnetic resonance imaging (mpMRI) scans, reflecting varying biological properties. Their heterogeneous shape, extent, and location are some of the factors that make these tumors difficult to resect, and in some cases inoperable. The amount of resected tumor is a factor also considered in longitudinal scans, when evaluating the apparent tumor for potential diagnosis of progression. Furthermore, there is mounting evidence that accurate segmentation of the various tumor sub-regions can offer the basis for quantitative image analysis towards prediction of patient overall survival. This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i.e., 2012-2018. Specifically, we focus on i) evaluating segmentations of the various glioma sub-regions in pre-operative mpMRI scans, ii) assessing potential tumor progression by virtue of longitudinal growth of tumor sub-regions, beyond use of the RECIST/RANO criteria, and iii) predicting the overall survival from pre-operative mpMRI scans of patients that underwent gross total resection. Finally, we investigate the challenge of identifying the best ML algorithms for each of these tasks, considering that apart from being diverse on each instance of the challenge, the multi-institutional mpMRI BraTS dataset has also been a continuously evolving/growing dataset.

NASep 17, 2018
A robust and efficient iterative method for hyper-elastodynamics with nested block preconditioning

Ju Liu, Alison L. Marsden

We develop a robust and efficient iterative method for hyper-elastodynamics based on a novel continuum formulation recently developed. The numerical scheme is constructed based on the variational multiscale formulation and the generalized-$α$ method. Within the nonlinear solution procedure, a block factorization is performed for the consistent tangent matrix to decouple the kinematics from the balance laws. Within the linear solution procedure, another block factorization is performed to decouple the mass balance equation from the linear momentum balance equations. A nested block preconditioning technique is proposed to combine the Schur complement reduction approach with the fully coupled approach. This preconditioning technique, together with the Krylov subspace method, constitutes a novel iterative method for solving hyper-elastodynamics. We demonstrate the efficacy of the proposed preconditioning technique by comparing with the SIMPLE preconditioner and the one-level domain decomposition preconditioner. Two representative examples are studied: the compression of an isotropic hyperelastic cube and the tensile test of a fully-incompressible anisotropic hyperelastic arterial wall model. The robustness with respect to material properties and the parallel performance of the preconditioner are examined.

COMP-PHSep 4, 2018
Thermal convection in the van der Waals fluid

Ju Liu

In this work, the van der Waals fluid model, a diffuse-interface model for liquid-vapor two-phase flows, is numerically investigated. The thermodynamic properties of the van der Waals fluid are first studied. Dimensional analysis is performed to identify the control parameters for the system. An entropy-stable numerical scheme and isogeometric analysis are utilized to discretize the governing equations for numerical simulations. The steady state solution at low Rayleigh number is presented, demonstrating the capability of the model in describing liquid-vapor phase transitions. Next, two-dimensional nucleate and film boiling are simulated, showing the applicability of the model in different regimes of boiling. In the last, the heat transport property of the van der Waals model is numerically investigated. The scaling law for the Nusselt number with respect to the Rayleigh number in the van der Waals model is obtained by performing a suite of high-resolution simulations.

MMJan 18, 2016
Multiple Watermarking Algorithm Based on Spread Transform Dither Modulation

Xinchao Li, Ju Liu, Jiande Sun et al.

Multiple watermarking technique, embedding several watermarks in one carrier, has enabled many interesting applications. In this study, a novel multiple watermarking algorithm is proposed based on the spirit of spread transform dither modulation (STDM). It can embed multiple watermarks into the same region and the same transform domain of one image; meanwhile, the embedded watermarks can be extracted independently and blindly in the detector without any interference. Furthermore, to improve the fidelity of the watermarked image, the properties of the dither modulation quantizer and the proposed multiple watermarks embedding strategy are investigated, and two practical optimization methods are proposed. Finally, to enhance the application flexibility, an extension of the proposed algorithm is proposed which can sequentially embeds different watermarks into one image during each stage of its circulation. Compared with the pioneering multiple watermarking algorithms, the proposed one owns more flexibility in practical application and is more robust against distortion due to basic operations such as random noise, JPEG compression and volumetric scaling.