Probabilistic Computers for MIMO Detection: From Sparsification to 2D Parallel Tempering

arXiv:2601.0903727.53 citationsh-index: 11
Predicted impact top 34% in ET · last 90 daysOriginality Highly original
AI Analysis

This work addresses the dense connectivity bottleneck in probabilistic computers for real-world combinatorial optimization, offering a tuning-free algorithmic framework that could enable scalable hardware for next-generation wireless MIMO detection.

The paper introduces graph sparsification with auxiliary copy variables and a two-dimensional parallel tempering (2D-PT) algorithm for probabilistic computers, applied to MIMO detection. On 64×64 BPSK MIMO, the sparsified system achieves bit error rates below conventional linear detectors with 3 ms per instance, while 2D-PT eliminates manual tuning and converges 250× faster than 1D-PT on spin glasses, reaching zero bit errors on 128×128 MIMO.

Probabilistic computers built from p-bits offer a promising path for combinatorial optimization, but the dense connectivity required by real-world problems scales poorly in hardware. Here, we address this through graph sparsification with auxiliary copy variables and demonstrate two fully on-chip parallel tempering solvers on an FPGA. Targeting MIMO detection, a dense, NP-hard problem central to wireless communications, we first fit 11 temperature replicas of a 128-node sparsified system (1,408 p-bits) on-chip and achieve bit error rates significantly below conventional linear detectors on $64 \times 64$ BPSK MIMO. We report complete end-to-end solution times of 3~ms per instance, including all loading, sampling, readout, and verification overheads. ASIC projections in 7~nm technology indicate 103~MHz operation at 285.8~mW, suggesting that massive parallelism across multiple chips could approach the throughput demands of next-generation wireless systems. Sparsification, however, introduces a sharp sensitivity to the copy-constraint strength $P$ that requires manual tuning. To eliminate this bottleneck, we utilize Two-Dimensional Parallel Tempering (2D-PT), which exchanges replicas across both temperature ($β$) and constraint ($P$) dimensions. On Sherrington--Kirkpatrick spin glasses, 2D-PT converges roughly $250\times$ faster than optimally tuned 1D-PT, and on $128 \times 128$ MIMO it reaches zero bit errors at high SNR where 1D-PT exhibits an error floor. We further validate 2D-PT entirely on-chip with 54 replicas (1,728 p-bits) on a $16 \times 16$ MIMO instance, where it tracks the maximum-likelihood bound in just 50 Monte Carlo steps -- $10\times$ fewer than 1D-PT -- at projected 111~MHz and 124~mW in 7~nm. Together, these results establish an on-chip p-bit architecture and a scalable, tuning-free algorithmic framework for dense combinatorial optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes