An exploration of the uncertainty relation satisfied by BP network learning ability and generalization ability

被引:4
|
作者
Li, ZY
Peng, LH [1 ]
机构
[1] Xiamen Univ, Environm Sci Res Ctr, Xiamen 361005, Peoples R China
[2] Chengdu Univ Informat Technol, Chengdu 610041, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
BP network; learning ability; generalization ability; overfit relation; network structure optimization;
D O I
10.1360/02yf0331
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyses the intrinsic relationship between the BP network learning ability and generalization ability and other influencing factors when the overfit occurs, and introduces the multiple correlation coefficient to describe the complexity of samples; it follows the calculation uncertainty principle and the minimum principle of neural network structural design, provides an analogy of the general uncertainty relation in the information transfer process, and ascertains the uncertainty relation between the training relative error of the training sample set, which reflects the network learning ability, and the test relative error of the test sample set, which represents the network generalization ability; through the simulation of BP network overfit numerical modeling test with different types of functions, it is ascertained that the overfit parameter q in the relation generally has a span of 7 x 10(-3) to 7 x 10(-2); the uncertainty relation then helps to obtain the formula for calculating the number of hidden nodes of a network with good generalization ability under the condition that multiple correlation coefficient is used to describe sample complexity and the given approximation error requirement is satisfied; the rationality of this formula is verified; this paper also points out that applying the BP network to the training process of the given sample set is the best method for stopping training that improves the generalization ability.
引用
收藏
页码:137 / 150
页数:14
相关论文
共 50 条
  • [31] Role of function complexity and network size in the generalization ability of feedforward networks
    Franco, L
    Jerez, JM
    Bravo, JM
    COMPUTATIONAL INTELLIGENCE AND BIOINSPIRED SYSTEMS, PROCEEDINGS, 2005, 3512 : 1 - 8
  • [32] GENERALIZATION ABILITY OF EXTENDED CASCADED ARTIFICIAL NEURAL-NETWORK ARCHITECTURE
    KAMRUZZAMAN, J
    KUMAGAI, Y
    HIKITA, H
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (10) : 1877 - 1883
  • [33] Exploration and Practice on Undergraduate Research Ability Cultivation in Network Environment
    Hao, Shangfu
    Hao, Hui
    Guo, Zhenghong
    Ren, Honghong
    ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT III, 2011, 216 : 568 - +
  • [34] Spatial generalization ability analysis of deep learning crop classification models
    Ge S.
    Zhang J.
    Zhu S.
    National Remote Sensing Bulletin, 2023, 27 (12) : 2796 - 2814
  • [35] Generalization ability of extreme learning machine with uniformly ergodic Markov chains
    Yuan, Peipei
    Chen, Hong
    Zhou, Yicong
    Deng, Xiaoyan
    Zou, Bin
    NEUROCOMPUTING, 2015, 167 : 528 - 534
  • [36] Random CNN structure: tool to increase generalization ability in deep learning
    Bartosz Swiderski
    Stanislaw Osowski
    Grzegorz Gwardys
    Jaroslaw Kurek
    Monika Slowinska
    Iwona Lugowska
    EURASIP Journal on Image and Video Processing, 2022
  • [37] Random CNN structure: tool to increase generalization ability in deep learning
    Swiderski, Bartosz
    Osowski, Stanislaw
    Gwardys, Grzegorz
    Kurek, Jaroslaw
    Slowinska, Monika
    Lugowska, Iwona
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2022, 2022 (01)
  • [38] IMPROVING THE GENERALIZATION ABILITY OF DEEPFAKE DETECTION VIA DISENTANGLED REPRESENTATION LEARNING
    Hu, Jiashang
    Wang, Shilin
    Li, Xiaoyong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3577 - 3581
  • [39] Individual Differences in Syntactic Ability and Construction Learning: An Exploration of the Relationship
    Riches, Nick
    Jackson, Laura
    LANGUAGE LEARNING, 2018, 68 (04) : 973 - 1000
  • [40] Exploration of innovative learning ability cultivation based on logistic regression
    Qi, Chengming
    Hu, Lishuan
    APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2022, 7 (02) : 1085 - 1092