Alphabetic Image Recognition Based on Multi-channel Diffractive Deep Neural Network

被引:0
|
作者
Zhou, Yuanguo [1 ]
Shui, Shan [1 ]
Chen, Yu [1 ]
Liang, Bingyang [1 ]
Cai, Yijun [2 ]
机构
[1] Xian Univ Sci & Technol, Coll Commun & Informat Engn, Xian 710054, Peoples R China
[2] Xiamen Univ Technol, Fujian Prov Key Lab Optoelect Technol & Devices, Xiamen 361024, Peoples R China
关键词
D O I
10.1109/PIERS53385.2021.9694760
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning has made remarkable achievements in image classification in recent years. It has also inspired the emergence of optical machine learning frameworks as an effective combination of optics and artificial intelligence. Diffractive deep neural network (D2NN) is a recently proposed optical neural network framework, which is designed and trained by computers, and then fabricated by the usage of 3D printing technology. It can recognize handwritten digital dataset and fashion product dataset with only light source, and the recognition effect is comparable to that of electronic neural network. We propose a diffractive light neural network structure is improved on the basis of the traditional diffractive deep neural network (D2NN), providing a multi-channel design model, while the distribution of neuron nodes on the network layer is adjusted in such a way that different network layers correspond to incoming light waves of different frequencies. The neuron nodes on the network layer are arranged in a concentric circle pattern to reduce the parameters generated during network training, and in the output layer we get channels of different frequencies and then merge these channels to obtain the final results. It is experimentally verified that the diffractive optical neural network with multiple channels has a large improvement of 95.44% recognition accuracy compared to the 89.92% recognition accuracy of the traditional D2NN for recognition of 10 different alphabetic image datasets. The proposed network improves the recognition accuracy by increasing the number of channels while reducing the parameters, which provides a guidance to design optical networks with reduced cost and improved efficiency.
引用
收藏
页码:1384 / 1387
页数:4
相关论文
共 50 条
  • [1] FCNet: a deep neural network based on multi-channel feature cascading for image denoising
    Feng, Siling
    Qi, Zhisheng
    Zhang, Guirong
    Lin, Cong
    Huang, Mengxing
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17042 - 17067
  • [2] Fire Recognition Based On Multi-Channel Convolutional Neural Network
    Mao, Wentao
    Wang, Wenpeng
    Dou, Zhi
    Li, Yuan
    [J]. FIRE TECHNOLOGY, 2018, 54 (02) : 531 - 554
  • [3] Fire Recognition Based On Multi-Channel Convolutional Neural Network
    Wentao Mao
    Wenpeng Wang
    Zhi Dou
    Yuan Li
    [J]. Fire Technology, 2018, 54 : 531 - 554
  • [4] Video fire recognition based on multi-channel convolutional neural network
    Zhong, Chen
    Shao, Yu
    Ding, Hongjun
    Wang, Ke
    [J]. 2020 3RD INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY (CISAT) 2020, 2020, 1634
  • [5] Correction to: Fire Recognition Based On Multi-Channel Convolutional Neural Network
    Wentao Mao
    Wenpeng Wang
    Zhi Dou
    Yuan Li
    [J]. Fire Technology, 2018, 54 : 809 - 809
  • [6] Automatic Modulation Recognition Based on Multi-Channel Neural Network Model
    Zhang, Xianchao
    Ma, Shengyu
    Shi, Jian
    Li, Panpan
    Yue, Guangxue
    [J]. 2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 326 - 330
  • [7] A Multi-channel Neural Network for Imbalanced Emotion Recognition
    Li, Ran
    Si, Qingyi
    Fu, Peng
    Lin, Zheng
    Wang, Weiping
    Shi, Gang
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 353 - 360
  • [8] Deep Neural Network-Based Generalized Sidelobe Canceller for Robust Multi-channel Speech Recognition
    Li, Guanjun
    Liang, Shan
    Nie, Shuai
    Liu, Wenju
    Yang, Zhanlei
    Xiao, Longshuai
    [J]. INTERSPEECH 2020, 2020, : 51 - 55
  • [9] A Multi-Channel Deep Neural Network for Relation Extraction
    Chen, Yanping
    Wang, Kai
    Yang, Weizhe
    Qin, Yongbin
    Huang, Ruizhang
    Chen, Ping
    [J]. IEEE ACCESS, 2020, 8 : 13195 - 13203
  • [10] Based on Probabilistic Neural Network of Human Multi-channel Semg Pattern Recognition
    Dai, Changming
    Du, Chunmei
    Qi, Aiha
    Liu, Jiwen
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1354 - 1356