Influence of Hyperparameters on Performance of Optical Neural Network Training Algorithms

被引:1
|
作者
Cao Wen [1 ]
Liu Meiyu [1 ]
Lu Minghao [1 ]
Shao Xiaofeng [1 ]
Liu Qifa [1 ]
Wang Jin [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Commun & Informat Engn, Nanjing 210003, Jiangsu, Peoples R China
关键词
optical neural network; Mach; Zehnder interferometer; training algorithm; momentum; learning rate; ARTIFICIAL-INTELLIGENCE; DESIGN;
D O I
10.3788/LOP230535
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An optical neural network (ONN) based on fast Fourier transform (FFT) is constructed for digital image recognition in optical devices. Herein, ONN uses Mach-Zehnder interferometer (MZI) as its linear optical processing unit. These MZIs are connected in a grid- like layout and modulate the passing optical signals to achieve multiplication and addition. Subsequently, MZIs achieve classification and recognition for images. In this study, the influence of main hyperparameters (e. g., momentum coefficient and learning rate of the training algorithm) on the performance of ONN in recognizing handwritten digital images is investigated. First, the performance of ONN with four training algorithms in recognizing handwritten digital images under different learning rates is compared. These algorithms connect with different nonlinear functions and different number of hidden layers, namely, stochastic gradient descent (SGD), root mean square prop (RMSprop), adaptive moment estimation (Adam), and adaptive gradient (Adagrad). Additionally, the accuracy, running memory, and training time of ONN with the SGD algorithm connected with different nonlinear functions and different number of hidden layers are analyzed under different momentum coefficients. The recognition performance of ONN with SGD and RMSprop training algorithms is also compared after the introduction of momentum, where the learning rate is 0. 05 and 0. 005. The experimental results show that when the learning rate changes from 0. 5 to 5 x 10(- 5), the FFT-typed ONN with the RMSprop training algorithm, two hidden layers, and the nonlinear function of Softplus has the highest recognition accuracy, reaching 97. 4%. Furthermore, for the momentum coefficient of 0, the ONN with two hidden layers and the nonlinear function of Softplus trained by the SGD algorithm exhibits the highest recognition accuracy of 96%, when the momentum coefficient increases to 0. 9, the accuracy of ONN is improved to 96. 9%. However, the RMSprop algorithm with momentum leads to nonconvergence or slow convergence of network recognition accuracy.
引用
收藏
页数:8
相关论文
共 25 条
  • [1] On improving CNNs performance: The case of MNIST
    Alvear-Sandoval, Ricardo F.
    Sancho-Gomez, Jose L.
    Figueiras-Vidal, Anibal R.
    [J]. INFORMATION FUSION, 2019, 52 : 106 - 109
  • [2] Monolayer graphene as a saturable absorber in a mode-locked laser
    Bao, Qiaoliang
    Zhang, Han
    Ni, Zhenhua
    Wang, Yu
    Polavarapu, Lakshminarayana
    Shen, Zexiang
    Xu, Qing-Hua
    Tang, Dingyuan
    Loh, Kian Ping
    [J]. NANO RESEARCH, 2011, 4 (03) : 297 - 307
  • [3] Optimal design for universal multiport interferometers
    Clements, William R.
    Humphreys, Peter C.
    Metcalf, Benjamin J.
    Kolthammer, W. Steven
    Walmsley, Ian A.
    [J]. OPTICA, 2016, 3 (12): : 1460 - 1465
  • [4] Machine Learning With Neuromorphic Photonics
    de Lima, Thomas Ferreira
    Peng, Hsuan-Tung
    Tait, Alexander N.
    Nahmias, Mitchell A.
    Miller, Heidi B.
    Shastri, Bhavin J.
    Prucnal, Paul R.
    [J]. JOURNAL OF LIGHTWAVE TECHNOLOGY, 2019, 37 (05) : 1515 - 1534
  • [5] Design of optical neural networks with component imprecisions
    Fang, Michael Y-S
    Manipatruni, Sasikanth
    Wierzynski, Casimir
    Khosrowshahi, Amir
    DeWeese, Michael R.
    [J]. OPTICS EXPRESS, 2019, 27 (10): : 14009 - 14029
  • [6] Gu JQ, 2020, ASIA S PACIF DES AUT, P476, DOI 10.1109/ASP-DAC47756.2020.9045156
  • [7] Training of photonic neural networks through in situ backpropagation and gradient measurement
    Hughes, Tyler W.
    Minkov, Momchil
    Shi, Yu
    Fan, Shanhui
    [J]. OPTICA, 2018, 5 (07): : 864 - 871
  • [8] Deep learning
    LeCun, Yann
    Bengio, Yoshua
    Hinton, Geoffrey
    [J]. NATURE, 2015, 521 (7553) : 436 - 444
  • [9] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
    Ledig, Christian
    Theis, Lucas
    Huszar, Ferenc
    Caballero, Jose
    Cunningham, Andrew
    Acosta, Alejandro
    Aitken, Andrew
    Tejani, Alykhan
    Totz, Johannes
    Wang, Zehan
    Shi, Wenzhe
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 105 - 114
  • [10] Precipitation Intensity Recognition Based on Convolution Neural Network with Fused Encoded and Decoded Features
    Lin Mengxiang
    Huang Xiuping
    Lin Zhiwei
    Hong Sidi
    Liu Jinfu
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (02)