Mutual Information-Based Neural Network Distillation for Improving Photonic Neural Network Training

被引:1
|
作者
Chariton, Alexandros [1 ]
Passalis, Nikolaos [1 ]
Pleros, Nikos [2 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, AIIA Lab, Computat Intelligence & Deep Learning Grp, Thessaloniki 54124, Greece
[2] Aristotle Univ Thessaloniki, Dept Informat, Wireless & Photon Syst & Networks Grp, Thessaloniki 54124, Greece
关键词
Photonic neural networks; Knowledge distillation; Mutual information; Photonic neuromorphics;
D O I
10.1007/s11063-023-11170-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Photonic neural networks are among the most promising recently proposed neuromorphic solutions for providing fast and energy efficient Deep Learning (DL) implementations. However, photonic networks need to be first appropriately trained to support the underlying hardware implementation (e.g., activation functions), since it is not possible to directly deploy an already trained DL model into a neuromorphic platform that uses different transfer functions compared to regular networks. The main hypothesis examined in this paper is whether we can employ a neural network distillation approach to transfer the knowledge from a regular DL model into a photonic neural network reducing in this way the accuracy gap between regular and photonic models. Furthermore, we propose a novel fully differentiable formulation of mutual information that can be used for efficiently transferring the knowledge between layers of a regular neural networks and photonic neural networks. In this way, we further increase the effectiveness of distillation by also taking into account the behavior of intermediate layers of DL models. Experiments conducted on two image datasets demonstrate the effectiveness of the proposed approach, further closing the accuracy gap between photonic and regular DL models.
引用
收藏
页码:8589 / 8604
页数:16
相关论文
共 50 条
  • [1] Mutual Information-Based Neural Network Distillation for Improving Photonic Neural Network Training
    Alexandros Chariton
    Nikolaos Passalis
    Nikos Pleros
    Anastasios Tefas
    [J]. Neural Processing Letters, 2023, 55 : 8589 - 8604
  • [2] Mutual Information-based RBM Neural Networks
    Peng, Kang-Hao
    Zhang, Heng
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2458 - 2463
  • [3] A Priori Information-Based Lightweight Convolutional Neural Network for Video Denoising
    Shentu, Min-Jian
    Zhu, Qiang
    Zhu, Shu-Yuan
    Meng, Xian-Dong
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (06): : 1510 - 1517
  • [4] Mutual information-based LPI optimisation for radar network
    Shi, Chenguang
    Zhou, Jianjiang
    Wang, Fei
    Chen, Jun
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS, 2015, 102 (07) : 1114 - 1131
  • [5] Environment Information-Based Channel Prediction Method Assisted by Graph Neural Network
    Sun, Yutong
    Zhang, Jianhua
    Zhang, Yuxiang
    Yu, Li
    Wang, Qixing
    Liu, Guangyi
    [J]. CHINA COMMUNICATIONS, 2022, 19 (11) : 1 - 15
  • [6] Environment Information-Based Channel Prediction Method Assisted by Graph Neural Network
    Yutong Sun
    Jianhua Zhang
    Yuxiang Zhang
    Li Yu
    Qixing Wang
    Guangyi Liu
    [J]. China Communications, 2022, 19 (11) : 1 - 15
  • [7] Multi-stream Information-Based Neural Network for Mammogram Mass Segmentation
    Li, Zhilin
    Deng, Zijian
    Chen, Li
    Gui, Yu
    Cai, Zhigang
    Liao, Jianwei
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 267 - 278
  • [8] Research on Mutual Information-Based Brain Network and Lie Detection
    Peng, Si-Yu
    Zhou, Dao
    Zhang, Jia-Qi
    Wang, Yu
    Gao, Jun-Feng
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (07): : 1551 - 1556
  • [9] Mutual information in a dilute, asymmetric neural network model
    Greenfeld, E
    Lecar, H
    [J]. PHYSICAL REVIEW E, 2001, 63 (04):
  • [10] Correlation analysis and prediction of power network loss based on mutual information and artificial neural network
    Bai, Jianghong
    Jiang, Mu
    Liu, Liping
    Sun, Yunchao
    Wang, Yuxing
    Zhang, Jiaan
    [J]. THIRD INTERNATIONAL CONFERENCE ON ENERGY ENGINEERING AND ENVIRONMENTAL PROTECTION, 2019, 227