Influence of Hyperparameters on Performance of Optical Neural Network Training Algorithms

被引：1

作者：

Cao Wen ^{[1
]}

Liu Meiyu ^{[1
]}

Lu Minghao ^{[1
]}

Shao Xiaofeng ^{[1
]}

Liu Qifa ^{[1
]}

Wang Jin ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Commun & Informat Engn, Nanjing 210003, Jiangsu, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2023年 / 60卷 / 22期

关键词：

optical neural network; Mach; Zehnder interferometer; training algorithm; momentum; learning rate; ARTIFICIAL-INTELLIGENCE; DESIGN;

D O I：

10.3788/LOP230535

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

An optical neural network (ONN) based on fast Fourier transform (FFT) is constructed for digital image recognition in optical devices. Herein, ONN uses Mach-Zehnder interferometer (MZI) as its linear optical processing unit. These MZIs are connected in a grid- like layout and modulate the passing optical signals to achieve multiplication and addition. Subsequently, MZIs achieve classification and recognition for images. In this study, the influence of main hyperparameters (e. g., momentum coefficient and learning rate of the training algorithm) on the performance of ONN in recognizing handwritten digital images is investigated. First, the performance of ONN with four training algorithms in recognizing handwritten digital images under different learning rates is compared. These algorithms connect with different nonlinear functions and different number of hidden layers, namely, stochastic gradient descent (SGD), root mean square prop (RMSprop), adaptive moment estimation (Adam), and adaptive gradient (Adagrad). Additionally, the accuracy, running memory, and training time of ONN with the SGD algorithm connected with different nonlinear functions and different number of hidden layers are analyzed under different momentum coefficients. The recognition performance of ONN with SGD and RMSprop training algorithms is also compared after the introduction of momentum, where the learning rate is 0. 05 and 0. 005. The experimental results show that when the learning rate changes from 0. 5 to 5 x 10(- 5), the FFT-typed ONN with the RMSprop training algorithm, two hidden layers, and the nonlinear function of Softplus has the highest recognition accuracy, reaching 97. 4%. Furthermore, for the momentum coefficient of 0, the ONN with two hidden layers and the nonlinear function of Softplus trained by the SGD algorithm exhibits the highest recognition accuracy of 96%, when the momentum coefficient increases to 0. 9, the accuracy of ONN is improved to 96. 9%. However, the RMSprop algorithm with momentum leads to nonconvergence or slow convergence of network recognition accuracy.

引用

页数：8

共 25 条

[1] On improving CNNs performance: The case of MNIST
Alvear-Sandoval, Ricardo F.
Sancho-Gomez, Jose L.
Figueiras-Vidal, Anibal R.
[J]. INFORMATION FUSION, 2019, 52 : 106 - 109
[2] Monolayer graphene as a saturable absorber in a mode-locked laser
Bao, Qiaoliang
Zhang, Han
Ni, Zhenhua
Wang, Yu
Polavarapu, Lakshminarayana
Shen, Zexiang
Xu, Qing-Hua
Tang, Dingyuan
Loh, Kian Ping
[J]. NANO RESEARCH, 2011, 4 (03) : 297 - 307
[3] Optimal design for universal multiport interferometers
Clements, William R.
Humphreys, Peter C.
Metcalf, Benjamin J.
Kolthammer, W. Steven
Walmsley, Ian A.
[J]. OPTICA, 2016, 3 (12): : 1460 - 1465
[4] Machine Learning With Neuromorphic Photonics
de Lima, Thomas Ferreira
Peng, Hsuan-Tung
Tait, Alexander N.
Nahmias, Mitchell A.
Miller, Heidi B.
Shastri, Bhavin J.
Prucnal, Paul R.
[J]. JOURNAL OF LIGHTWAVE TECHNOLOGY, 2019, 37 (05) : 1515 - 1534
[5] Design of optical neural networks with component imprecisions
Fang, Michael Y-S
Manipatruni, Sasikanth
Wierzynski, Casimir
Khosrowshahi, Amir
DeWeese, Michael R.
[J]. OPTICS EXPRESS, 2019, 27 (10): : 14009 - 14029
[6] Gu JQ, 2020, ASIA S PACIF DES AUT, P476, DOI 10.1109/ASP-DAC47756.2020.9045156
[7] Training of photonic neural networks through in situ backpropagation and gradient measurement
Hughes, Tyler W.
Minkov, Momchil
Shi, Yu
Fan, Shanhui
[J]. OPTICA, 2018, 5 (07): : 864 - 871
[8] Deep learning
LeCun, Yann
Bengio, Yoshua
Hinton, Geoffrey
[J]. NATURE, 2015, 521 (7553) : 436 - 444
[9] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Ledig, Christian
Theis, Lucas
Huszar, Ferenc
Caballero, Jose
Cunningham, Andrew
Acosta, Alejandro
Aitken, Andrew
Tejani, Alykhan
Totz, Johannes
Wang, Zehan
Shi, Wenzhe
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 105 - 114
[10] Precipitation Intensity Recognition Based on Convolution Neural Network with Fused Encoded and Decoded Features
Lin Mengxiang
Huang Xiuping
Lin Zhiwei
Hong Sidi
Liu Jinfu
[J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (02)

← 1 2 3 →