INS-DETLGIVFeb 27, 2025

FPGA-Accelerated SpeckleNN with SNL for Real-time X-ray Single-Particle Imaging

arXiv:2502.19734v14 citationsh-index: 3Frontiers in High Performance Computing
Originality Incremental advance
AI Analysis

This enables real-time adaptive classification for high-throughput X-ray free-electron laser facilities, though it is incremental as it optimizes an existing model for hardware constraints.

They tackled real-time speckle pattern classification in X-ray Single-Particle Imaging by implementing a specialized SpeckleNN model on an FPGA, achieving an 8.9x speedup and 7.8x power reduction compared to a GPU with 90% accuracy.

We implement a specialized version of our SpeckleNN model for real-time speckle pattern classification in X-ray Single-Particle Imaging (SPI) using the SLAC Neural Network Library (SNL) on an FPGA. This hardware is optimized for inference near detectors in high-throughput X-ray free-electron laser (XFEL) facilities like the Linac Coherent Light Source (LCLS). To fit FPGA constraints, we optimized SpeckleNN, reducing parameters from 5.6M to 64.6K (98.8% reduction) with 90% accuracy. We also compressed the latent space from 128 to 50 dimensions. Deployed on a KCU1500 FPGA, the model used 71% of DSPs, 75% of LUTs, and 48% of FFs, with an average power consumption of 9.4W. The FPGA achieved 45.015us inference latency at 200 MHz. On an NVIDIA A100 GPU, the same inference consumed ~73W and had a 400us latency. Our FPGA version achieved an 8.9x speedup and 7.8x power reduction over the GPU. Key advancements include model specialization and dynamic weight loading through SNL, eliminating time-consuming FPGA re-synthesis for fast, continuous deployment of (re)trained models. These innovations enable real-time adaptive classification and efficient speckle pattern vetoing, making SpeckleNN ideal for XFEL facilities. This implementation accelerates SPI experiments and enhances adaptability to evolving conditions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes