ARETLGFeb 1, 2025

Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression

arXiv:2502.01670v210 citationsh-index: 19Optica
Originality Synthesis-oriented
AI Analysis

This addresses scalability challenges for optical neural networks in AI hardware, representing a domain-specific incremental advance.

The paper tackles hardware limitations in optical neural networks by introducing a block-circulant photonic tensor core with structured compression, achieving up to 74.91% reduction in trainable parameters while maintaining competitive accuracy and a projected 3.56× improvement in power efficiency.

The rapid growth in computing demands, particularly driven by artificial intelligence applications, has begun to exceed the capabilities of traditional electronic hardware. Optical computing offers a promising alternative due to its parallelism, high computational speed, and low power consumption. However, existing photonic integrated circuits are constrained by large footprints, costly electro-optical interfaces, and complex control mechanisms, limiting the practical scalability of optical neural networks (ONNs). To address these limitations, we introduce a block-circulant photonic tensor core for a structure-compressed optical neural network (StrC-ONN) architecture. The structured compression technique substantially reduces both model complexity and hardware resources without sacrificing the versatility of neural networks, and achieves accuracy comparable to uncompressed models. Additionally, we propose a hardware-aware training framework to compensate for on-chip nonidealities to improve model robustness and accuracy. Experimental validation through image processing and classification tasks demonstrates that our StrC-ONN achieves a reduction in trainable parameters of up to 74.91%,while still maintaining competitive accuracy levels. Performance analyses further indicate that this hardware-software co-design approach is expected to yield a 3.56 times improvement in power efficiency. By reducing both hardware requirements and control complexity across multiple dimensions, this work explores a new pathway toward practical and scalable ONNs, highlighting a promising route to address future computational efficiency challenges.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes