Mutual information based weight initialization method for sigmoidal feedforward neural networks

被引：29

作者：

Qiao, Junfei ^{[1
,2
]}

Li, Sanyi ^{[1
,2
]}

Li, Wenjing ^{[1
,2
]}

机构：

[1] Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing 100124, Peoples R China

[2] Beijing Key Lab Corhputat Intelligence & Intellig, Beijing 100124, Peoples R China

来源：

NEUROCOMPUTING | 2016年 / 207卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Sigthoidal feedforward neural network; Weight initialization; Mutual information; SEQUENCING BATCH REACTOR; MULTILAYER PERCEPTRON; VARIABLE SELECTION; TRAINING SPEED; BACKPROPAGATION; ALGORITHM; RELEVANCE; VALUES; SERIES;

D O I：

10.1016/j.neucom.2016.05.054

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When a sigmoidal feedforward neural network (SFNN) is trained by the gradient-based algorithms, the quality of the overall learning process strongly depends on the initial weights. To improve the algorithm stability and avoid local minima, a Mutual Information based weight initialization (MIWI) method is proposed for SFNN. The useful information contained in input variables is measured with the mutual information (MI) between input variables and output variables. The initial distribution of weights is consistent with the information distribution in the input variables. The lower and upper bounds of the weights range are calculated to ensure the neurons inputs are within the active region of sigmoid function. The MIWI method makes the initial weights close to the global optimal point with a higher probability and avoids premature saturation. The efficiency of the MIWI method is evaluated based on several benchmark problems. The experimental results show that the stability and accuracy of the proposed method are better than some other weight initialization methods. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：676 / 683

页数：8

共 50 条

[1] Interval Based Weight Initialization Method for Sigmoidal Feedforward Artificial Neural Networks
Sodhi, Sartaj Singh
Chandra, Pravin
[J]. 2ND AASRI CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, 2014, 6 : 19 - 25
[2] A New Weight Initialization Method for Sigmoidal Feedforward Artificial Neural Networks
Sodhi, Sartaj Singh
Chandra, Pravin
Tanwar, Sharad
[J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 291 - 298
[3] Weight and bias initialization routines for Sigmoidal Feedforward Network
Apeksha Mittal
Amit Prakash Singh
Pravin Chandra
[J]. Applied Intelligence, 2021, 51 : 2651 - 2671
[4] Weight and bias initialization routines for Sigmoidal Feedforward Network
Mittal, Apeksha
Singh, Amit Prakash
Chandra, Pravin
[J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2651 - 2671
[5] An overview on weight initialization methods for feedforward neural networks
de Sousa, Celso A. R.
[J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 52 - 59
[6] Analyzing weight distribution of feedforward neural networks and efficient weight initialization
Go, J
Baek, B
Lee, C
[J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 840 - 849
[7] An interval approach for weight's initialization of feedforward neural networks
Jamett, Marcela
Acuna, Gonzalo
[J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 305 - +
[8] An effective SteinGLM initialization scheme for training multi-layer feedforward sigmoidal neural networks
Yang, Zebin
Zhang, Hengtao
Sudjianto, Agus
Zhang, Aijun
[J]. NEURAL NETWORKS, 2021, 139 : 149 - 157
[9] Feedforward neural networks initialization based on discriminant learning
Chumachenko, Kateryna
Iosifidis, Alexandros
Gabbouj, Moncef
[J]. NEURAL NETWORKS, 2022, 146 : 220 - 229
[10] A new weight initialization method for sigmoidal FFANN
Bhatia, M. P. S.
Veenu
Chandra, Pravin
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (05) : 5193 - 5201

← 1 2 3 4 5 →