Forward layer-wise learning of convolutional neural networks through separation index maximizing

被引:0
|
作者
Karimi, Ali [1 ]
Kalhor, Ahmad [1 ]
Tabrizi, Melika Sadeghi [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
关键词
D O I
10.1038/s41598-024-59176-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper proposes a forward layer-wise learning algorithm for CNNs in classification problems. The algorithm utilizes the Separation Index (SI) as a supervised complexity measure to evaluate and train each layer in a forward manner. The proposed method explains that gradually increasing the SI through layers reduces the input data's uncertainties and disturbances, achieving a better feature space representation. Hence, by approximating the SI with a variant of local triplet loss at each layer, a gradient-based learning algorithm is suggested to maximize it. Inspired by the NGRAD (Neural Gradient Representation by Activity Differences) hypothesis, the proposed algorithm operates in a forward manner without explicit error information from the last layer. The algorithm's performance is evaluated on image classification tasks using VGG16, VGG19, AlexNet, and LeNet architectures with CIFAR-10, CIFAR-100, Raabin-WBC, and Fashion-MNIST datasets. Additionally, the experiments are applied to text classification tasks using the DBPedia and AG's News datasets. The results demonstrate that the proposed layer-wise learning algorithm outperforms state-of-the-art methods in accuracy and time complexity.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Layer-Wise Training Convolutional Neural Networks With Smaller Filters for Human Activity Recognition Using Wearable Sensors
    Tang, Yin
    Teng, Qi
    Zhang, Lei
    Min, Fuhong
    He, Jun
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (01) : 581 - 592
  • [32] LAYER-WISE INTERPRETATION OF DEEP NEURAL NETWORKS USING IDENTITY INITIALIZATION
    Kubota, Shohei
    Hayashi, Hideaki
    Hayase, Tomohiro
    Uchida, Seiichi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3945 - 3949
  • [33] Network with Sub-networks: Layer-wise Detachable Neural Network
    Fuengfusin, Ninnart
    Tamukoh, Hakaru
    [J]. JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2021, 7 (04): : 240 - 244
  • [34] Layer-Wise Relevance Propagation for Neural Networks with Local Renormalization Layers
    Binder, Alexander
    Montavon, Gregoire
    Lapuschkin, Sebastian
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 63 - 71
  • [35] Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation
    Filtjens, Benjamin
    Ginis, Pieter
    Nieuwboer, Alice
    Afzal, Muhammad Raheel
    Spildooren, Joke
    Vanrumste, Bart
    Slaets, Peter
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [36] Handwritten Devanagari Character Recognition Using Layer-Wise Training of Deep Convolutional Neural Networks and Adaptive Gradient Methods
    Jangid, Mahesh
    Srivastava, Sumit
    [J]. JOURNAL OF IMAGING, 2018, 4 (02)
  • [37] The Layer-Wise Training Convolutional Neural Networks Using Local Loss for Sensor-Based Human Activity Recognition
    Teng, Qi
    Wang, Kun
    Zhang, Lei
    He, Jun
    [J]. IEEE SENSORS JOURNAL, 2020, 20 (13) : 7265 - 7274
  • [38] Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation
    Benjamin Filtjens
    Pieter Ginis
    Alice Nieuwboer
    Muhammad Raheel Afzal
    Joke Spildooren
    Bart Vanrumste
    Peter Slaets
    [J]. BMC Medical Informatics and Decision Making, 21
  • [39] A Layer-Wise Surface Deformation Defect Detection by Convolutional Neural Networks in Laser Powder-Bed Fusion Images
    Ansari, Muhammad Ayub
    Crampton, Andrew
    Parkinson, Simon
    [J]. MATERIALS, 2022, 15 (20)
  • [40] Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
    Mern, John
    Gupta, Jayesh K.
    Kochenderfer, Mykel J.
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3314 - 3321