An exploration of the uncertainty relation satisfied by BP network learning ability and generalization ability

被引:6
|
作者
Li, ZY
Peng, LH [1 ]
机构
[1] Xiamen Univ, Environm Sci Res Ctr, Xiamen 361005, Peoples R China
[2] Chengdu Univ Informat Technol, Chengdu 610041, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
BP network; learning ability; generalization ability; overfit relation; network structure optimization;
D O I
10.1360/02yf0331
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyses the intrinsic relationship between the BP network learning ability and generalization ability and other influencing factors when the overfit occurs, and introduces the multiple correlation coefficient to describe the complexity of samples; it follows the calculation uncertainty principle and the minimum principle of neural network structural design, provides an analogy of the general uncertainty relation in the information transfer process, and ascertains the uncertainty relation between the training relative error of the training sample set, which reflects the network learning ability, and the test relative error of the test sample set, which represents the network generalization ability; through the simulation of BP network overfit numerical modeling test with different types of functions, it is ascertained that the overfit parameter q in the relation generally has a span of 7 x 10(-3) to 7 x 10(-2); the uncertainty relation then helps to obtain the formula for calculating the number of hidden nodes of a network with good generalization ability under the condition that multiple correlation coefficient is used to describe sample complexity and the given approximation error requirement is satisfied; the rationality of this formula is verified; this paper also points out that applying the BP network to the training process of the given sample set is the best method for stopping training that improves the generalization ability.
引用
收藏
页码:137 / 150
页数:14
相关论文
共 50 条
  • [41] An Accurate Outlier Rejection Network With Higher Generalization Ability for Point Cloud Registration
    Guo, Shiyi
    Tang, Fulin
    Liu, Bingxi
    Fu, Yujie
    Wu, Yihong
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4649 - 4656
  • [42] Enhancing the generalization ability of deep learning model for radio signal modulation recognition
    Faquan Wang
    Yucheng Zhou
    Hanzhi Yan
    Ruisen Luo
    [J]. Applied Intelligence, 2023, 53 : 18758 - 18774
  • [43] INTELLIGENCE AND BRAIN-DAMAGE IN THEIR RELATION TO INTELLECTUAL LEARNING ABILITY
    BECKER, P
    SCHMIDTKE, A
    [J]. HEILPADAGOGISCHE FORSCHUNG, 1977, 7 (02): : 186 - 207
  • [44] Generalization Ability of Bagging and Boosting Type Deep Learning Models in Evapotranspiration Estimation
    Kumar, Manoranjan
    Agrawal, Yash
    Adamala, Sirisha
    Subbarao, A. V. M.
    Singh, V. K.
    Srivastava, Ankur
    [J]. WATER, 2024, 16 (16)
  • [45] RELATION BETWEEN INTELLECTUAL ABILITY AND WORKING METHOD AS PREDICTORS OF LEARNING
    ELSHOUT, JJ
    VEENMAN, MVJ
    [J]. JOURNAL OF EDUCATIONAL RESEARCH, 1992, 85 (03): : 134 - 143
  • [46] Enhancing the generalization ability of deep learning model for radio signal modulation recognition
    Wang, Faquan
    Zhou, Yucheng
    Yan, Hanzhi
    Luo, Ruisen
    [J]. APPLIED INTELLIGENCE, 2023, 53 (15) : 18758 - 18774
  • [47] Heterogeneity in generalized reinforcement learning and its relation to cognitive ability
    Chen, Shu-Heng
    Du, Ye-Rong
    [J]. COGNITIVE SYSTEMS RESEARCH, 2017, 42 : 1 - 22
  • [48] Improvement of generalization ability for identifying dynamical systems by using universal learning networks
    Hirasawa, K
    Kim, S
    Hu, JL
    Murata, J
    Han, M
    Jin, CZ
    [J]. NEURAL NETWORKS, 2001, 14 (10) : 1389 - 1404
  • [49] Improvement of generalization ability for identifying dynamic systems by using universal learning networks
    Kim, SH
    Hirasawa, K
    Hu, JL
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1203 - 1208
  • [50] Accurate Estimates of the Generalization Ability for Symmetric Set of Predictors and Randomized Learning Algorithms
    Frei A.I.
    [J]. Pattern Recognition and Image Analysis, 2010, 20 (3) : 241 - 250