Mutual information based weight initialization method for sigmoidal feedforward neural networks

被引:29
|
作者
Qiao, Junfei [1 ,2 ]
Li, Sanyi [1 ,2 ]
Li, Wenjing [1 ,2 ]
机构
[1] Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing 100124, Peoples R China
[2] Beijing Key Lab Corhputat Intelligence & Intellig, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sigthoidal feedforward neural network; Weight initialization; Mutual information; SEQUENCING BATCH REACTOR; MULTILAYER PERCEPTRON; VARIABLE SELECTION; TRAINING SPEED; BACKPROPAGATION; ALGORITHM; RELEVANCE; VALUES; SERIES;
D O I
10.1016/j.neucom.2016.05.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When a sigmoidal feedforward neural network (SFNN) is trained by the gradient-based algorithms, the quality of the overall learning process strongly depends on the initial weights. To improve the algorithm stability and avoid local minima, a Mutual Information based weight initialization (MIWI) method is proposed for SFNN. The useful information contained in input variables is measured with the mutual information (MI) between input variables and output variables. The initial distribution of weights is consistent with the information distribution in the input variables. The lower and upper bounds of the weights range are calculated to ensure the neurons inputs are within the active region of sigmoid function. The MIWI method makes the initial weights close to the global optimal point with a higher probability and avoids premature saturation. The efficiency of the MIWI method is evaluated based on several benchmark problems. The experimental results show that the stability and accuracy of the proposed method are better than some other weight initialization methods. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:676 / 683
页数:8
相关论文
共 50 条
  • [1] Interval Based Weight Initialization Method for Sigmoidal Feedforward Artificial Neural Networks
    Sodhi, Sartaj Singh
    Chandra, Pravin
    [J]. 2ND AASRI CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, 2014, 6 : 19 - 25
  • [2] A New Weight Initialization Method for Sigmoidal Feedforward Artificial Neural Networks
    Sodhi, Sartaj Singh
    Chandra, Pravin
    Tanwar, Sharad
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 291 - 298
  • [3] Weight and bias initialization routines for Sigmoidal Feedforward Network
    Apeksha Mittal
    Amit Prakash Singh
    Pravin Chandra
    [J]. Applied Intelligence, 2021, 51 : 2651 - 2671
  • [4] Weight and bias initialization routines for Sigmoidal Feedforward Network
    Mittal, Apeksha
    Singh, Amit Prakash
    Chandra, Pravin
    [J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2651 - 2671
  • [5] An overview on weight initialization methods for feedforward neural networks
    de Sousa, Celso A. R.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 52 - 59
  • [6] Analyzing weight distribution of feedforward neural networks and efficient weight initialization
    Go, J
    Baek, B
    Lee, C
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 840 - 849
  • [7] An interval approach for weight's initialization of feedforward neural networks
    Jamett, Marcela
    Acuna, Gonzalo
    [J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 305 - +
  • [8] An effective SteinGLM initialization scheme for training multi-layer feedforward sigmoidal neural networks
    Yang, Zebin
    Zhang, Hengtao
    Sudjianto, Agus
    Zhang, Aijun
    [J]. NEURAL NETWORKS, 2021, 139 : 149 - 157
  • [9] Feedforward neural networks initialization based on discriminant learning
    Chumachenko, Kateryna
    Iosifidis, Alexandros
    Gabbouj, Moncef
    [J]. NEURAL NETWORKS, 2022, 146 : 220 - 229
  • [10] A new weight initialization method for sigmoidal FFANN
    Bhatia, M. P. S.
    Veenu
    Chandra, Pravin
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (05) : 5193 - 5201