ITMay 15
PrismQuant: Rate-Distortion-Optimal Vector Quantization for Gaussian-Mixture SourcesBumsu Park, Chanho Park, Youngmok Park et al.
For a Gaussian source under mean-squared error (MSE), classical transform coding is rate--distortion (RD) optimal: the Karhunen--Loeve transform (KLT) diagonalizes the covariance, reverse waterfilling allocates the bits, and scalar quantization closes the loop. This elegant story breaks down for multimodal sources, where no single covariance can capture heterogeneous local geometries, and the RD function loses its closed form. We revisit this problem through Gaussian-mixture sources and develop a constructive RD theory for them. Our key finding is that the mixture structure incurs only a component label cost. Conditioned on the active mixture component, each branch is Gaussian; the challenge is allocating bits across heterogeneous branches. We prove that the genie-aided conditional RD function is governed by a single global reverse-waterfilling level shared across all components and eigenmodes. Building on this result, we introduce PrismQuant, which transmits the component label losslessly and encodes the residual using the component-matched KLT, followed by scalar quantization, achieving a rate of H(C)/n bits per source dimension of the converse, with a vanishing asymptotic gap. We further develop a practical implementation based on EM-driven Gaussian-mixture learning, component-adaptive KLTs, and entropy-constrained scalar quantization (ECSQ). Experiments on synthetic Gaussian mixtures show that PrismQuant closely approaches the theoretical RD bound, while experiments on real-world channel-state-information (CSI) data demonstrate competitive or superior performance compared with transformer-based learned codecs at more than one order of magnitude smaller model size.
ITApr 19
CSI Compression for Massive MIMO-OFDM: Mismatch-Aware Rate-Distortion Trade-offsBumsu Park, Youngmok Park, Chanho Park et al.
We study channel state information (CSI) compression for wideband frequency division duplex massive multiple-input multiple-output (MIMO) when the base station (BS) reconstructs CSI using an imperfect covariance model. Under matched second-order statistics, remote rate--distortion theory yields transform coding with reverse water-filling (RWF) over covariance eigenmodes. With decoder-side covariance mismatch, however, this allocation is no longer end-to-end optimal. We derive an achievable mismatched Gaussian rate--distortion characterization based on a Gaussian test channel and a mismatched minimum mean square error (MMSE) reconstruction rule. In a shared-eigenvector regime (common eigenbasis, mismatched eigenvalues), the problem decouples across modes and leads to a robust reverse water-filling (RRWF) allocation computable via bisection and per-mode root finding. Simulations using wideband massive MIMO covariance models show that RRWF consistently improves reconstruction distortion and end-to-end mean square error relative to conventional RWF under mismatch.
ITApr 22
CSI Feedback Under Basis Mismatch: Rate-Splitting Transform Coding for FDD Massive MIMOYoungmok Park, Bumsu Park, Namyoon Lee
In frequency division duplex massive multiple-input multiple-output systems, downlink channel state information must be fed back within a limited uplink budget. While transform coding with Karhunen-Loeve transform and reverse water-filling is rate-distortion optimal for Gaussian channels, its performance is limited by basis mismatch between the user and base station. We analyze this mismatch and propose a practical architecture separating long-term basis feedback from short-term coefficient quantization. Using a random vector quantization, we derive a closed-form end-to-end mean square error expression. This allows us to characterize the optimal rate split and identify a phase transition threshold for basis updates. Simulations on correlated Gaussian and COST2100 channels demonstrate near-optimal performance, robustness to update overhead, and significant complexity reduction compared to deep-learning-based autoencoders.
ITMar 15
Fundamental Limits of CSI Compression in FDD Massive MIMOBumsu Park, Youngmok Park, Chanho Park et al.
Channel state information (CSI) feedback in frequency-division duplex (FDD) massive multiple-input multiple-output (MIMO) systems is fundamentally limited by the high dimensionality of wideband channels. In this paper, we model the stacked wideband CSI vector as a Gaussian-mixture source with a latent geometry state that represents different propagation environments. Each component corresponds to a locally stationary regime characterized by a correlated proper complex Gaussian distribution with its own covariance matrix. This representation captures the multimodal nature of practical CSI datasets while preserving the analytical tractability of Gaussian models. Motivated by this structure, we propose Gaussian-mixture transform coding (GMTC), a practical CSI feedback architecture that combines state inference with state-adaptive TC. The mixture parameters are learned offline from channel samples and stored as a shared statistical dictionary at both the user equipment (UE) and the base station. For each CSI realization, the UE identifies the most likely geometry state, encodes the corresponding label using a lossless source code, and compresses the CSI using the Karhunen-Loeve transform matched to that state. We further characterize the fundamental limits of CSI compression under this model by deriving analytical converse and achievability bounds on the rate-distortion (RD) function. A key structural result is that the optimal bit allocation across all mixture components is governed by a single global reverse-waterfilling level. Simulations on the COST2100 dataset show that GMTC significantly improves the RD tradeoff relative to neural transform coding approaches while requiring substantially smaller model memory and lower inference complexity. These results indicate that near-optimal CSI compression can be achieved through state-adaptive TC without relying on large neural encoders.