An exploration of the uncertainty relation satisfied by BP network learning ability and generalization ability

被引:6
|
作者
Li, ZY
Peng, LH [1 ]
机构
[1] Xiamen Univ, Environm Sci Res Ctr, Xiamen 361005, Peoples R China
[2] Chengdu Univ Informat Technol, Chengdu 610041, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
BP network; learning ability; generalization ability; overfit relation; network structure optimization;
D O I
10.1360/02yf0331
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyses the intrinsic relationship between the BP network learning ability and generalization ability and other influencing factors when the overfit occurs, and introduces the multiple correlation coefficient to describe the complexity of samples; it follows the calculation uncertainty principle and the minimum principle of neural network structural design, provides an analogy of the general uncertainty relation in the information transfer process, and ascertains the uncertainty relation between the training relative error of the training sample set, which reflects the network learning ability, and the test relative error of the test sample set, which represents the network generalization ability; through the simulation of BP network overfit numerical modeling test with different types of functions, it is ascertained that the overfit parameter q in the relation generally has a span of 7 x 10(-3) to 7 x 10(-2); the uncertainty relation then helps to obtain the formula for calculating the number of hidden nodes of a network with good generalization ability under the condition that multiple correlation coefficient is used to describe sample complexity and the given approximation error requirement is satisfied; the rationality of this formula is verified; this paper also points out that applying the BP network to the training process of the given sample set is the best method for stopping training that improves the generalization ability.
引用
收藏
页码:137 / 150
页数:14
相关论文
共 50 条
  • [21] Uncertainty relation suited to overfitting of BP neural network
    Li, ZY
    Deng, XM
    [J]. JOURNAL OF INFRARED AND MILLIMETER WAVES, 2000, 19 (02) : 142 - 144
  • [22] The Evaluation of Scientific Reasoning Ability Based on BP Neural Network
    Peng, Liangyu
    Bao, Lei
    Du, Chunhui
    [J]. HIGH PERFORMANCE NETWORKING, COMPUTING, AND COMMUNICATION SYSTEMS, 2011, 163 : 133 - +
  • [23] Analysis of quantum neural network learning ability
    Zhong, Yan-hua
    Man, Chang-qing
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 679 - 683
  • [24] SteganoCNN: Image Steganography with Generalization Ability Based on Convolutional Neural Network
    Duan, Xintao
    Liu, Nao
    Gou, Mengxiao
    Wang, Wenxin
    Qin, Chuan
    [J]. ENTROPY, 2020, 22 (10) : 1 - 15
  • [25] A deep convolutional neural network for topology optimization with perceptible generalization ability
    Wang, Dalei
    Xiang, Cheng
    Pan, Yue
    Chen, Airong
    Zhou, Xiaoyi
    Zhang, Yiquan
    [J]. ENGINEERING OPTIMIZATION, 2022, 54 (06) : 973 - 988
  • [26] Regularization theory in the study of generalization ability of a biological neural network model
    Swietlicka, Aleksandra
    [J]. ADVANCES IN COMPUTATIONAL MATHEMATICS, 2019, 45 (04) : 1793 - 1805
  • [27] Regularization theory in the study of generalization ability of a biological neural network model
    Aleksandra Świetlicka
    [J]. Advances in Computational Mathematics, 2019, 45 : 1793 - 1805
  • [28] GENERALIZATION ABILITY OF EXTENDED CASCADED ARTIFICIAL NEURAL-NETWORK ARCHITECTURE
    KAMRUZZAMAN, J
    KUMAGAI, Y
    HIKITA, H
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (10) : 1877 - 1883
  • [29] Role of function complexity and network size in the generalization ability of feedforward networks
    Franco, L
    Jerez, JM
    Bravo, JM
    [J]. COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 1 - 8
  • [30] Exploration and Practice on Undergraduate Research Ability Cultivation in Network Environment
    Hao, Shangfu
    Hao, Hui
    Guo, Zhenghong
    Ren, Honghong
    [J]. ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT III, 2011, 216 : 568 - +