ET LG OPTICSDec 20, 2022

Sophisticated deep learning with on-chip optical diffractive tensor processing

Yuyao Huang, Tingzhao Fu, Honghao Huang, Sigang Yang, Hongwei Chen

arXiv:2212.09975v11.226 citationsh-index: 30

Originality Incremental advance

AI Analysis

This provides a low-energy, high-parallelism solution for deep learning acceleration, though it appears incremental as it builds on existing photonic computing concepts.

The paper tackles the inefficiency of conventional computing for deep learning by proposing an optical computing architecture using on-chip diffraction to accelerate convolutions, achieving accuracies of 91.63% on Fashion-MNIST and 86.25% on CIFAR-4, and denoising with PSNR up to 31.70dB.

The ever-growing deep learning technologies are making revolutionary changes for modern life. However, conventional computing architectures are designed to process sequential and digital programs, being extremely burdened with performing massive parallel and adaptive deep learning applications. Photonic integrated circuits provide an efficient approach to mitigate bandwidth limitations and power-wall brought by its electronic counterparts, showing great potential in ultrafast and energy-free high-performance computing. Here, we propose an optical computing architecture enabled by on-chip diffraction to implement convolutional acceleration, termed optical convolution unit (OCU). We demonstrate that any real-valued convolution kernels can be exploited by OCU with a prominent computational throughput boosting via the concept of structral re-parameterization. With OCU as the fundamental unit, we build an optical convolutional neural network (oCNN) to implement two popular deep learning tasks: classification and regression. For classification, Fashion-MNIST and CIFAR-4 datasets are tested with accuracy of 91.63% and 86.25%, respectively. For regression, we build an optical denoising convolutional neural network (oDnCNN) to handle Gaussian noise in gray scale images with noise level σ = 10, 15, 20, resulting clean images with average PSNR of 31.70dB, 29.39dB and 27.72dB, respectively. The proposed OCU presents remarkable performance of low energy consumption and high information density due to its fully passive nature and compact footprint, providing a highly parallel while lightweight solution for future computing architecture to handle high dimensional tensors in deep learning.

View on arXiv PDF

Similar