GPU-accelerated simulated annealing based on p-bits with real-world device-variability modeling
This provides a scalable tool for researchers in probabilistic computing, though it is incremental in modeling real-world device behavior.
The study tackled the impact of device variability in probabilistic computing using p-bits, revealing that variability can enhance algorithm performance, and achieved a two-order magnitude speedup on GPU-accelerated simulated annealing for large-scale MAX-CUT problems.
Probabilistic computing using probabilistic bits (p-bits) presents an efficient alternative to traditional CMOS logic for complex problem-solving, including simulated annealing and machine learning. Realizing p-bits with emerging devices such as magnetic tunnel junctions (MTJs) introduces device variability, which was expected to negatively impact computational performance. However, this study reveals an unexpected finding: device variability can not only degrade but also enhance algorithm performance, particularly by leveraging timing variability. This paper introduces a GPU-accelerated, open-source simulated annealing framework based on p-bits that models key device variability factors -- timing, intensity, and offset -- to reflect real-world device behavior. Through CUDA-based simulations, our approach achieves a two-order magnitude speedup over CPU implementations on the MAX-CUT benchmark with problem sizes ranging from 800 to 20,000 nodes. By providing a scalable and accessible tool, this framework aims to advance research in probabilistic computing, enabling optimization applications in diverse fields.