An exploration of the uncertainty relation satisfied by BP network learning ability and generalization ability

被引:4
|
作者
Li, ZY
Peng, LH [1 ]
机构
[1] Xiamen Univ, Environm Sci Res Ctr, Xiamen 361005, Peoples R China
[2] Chengdu Univ Informat Technol, Chengdu 610041, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
BP network; learning ability; generalization ability; overfit relation; network structure optimization;
D O I
10.1360/02yf0331
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyses the intrinsic relationship between the BP network learning ability and generalization ability and other influencing factors when the overfit occurs, and introduces the multiple correlation coefficient to describe the complexity of samples; it follows the calculation uncertainty principle and the minimum principle of neural network structural design, provides an analogy of the general uncertainty relation in the information transfer process, and ascertains the uncertainty relation between the training relative error of the training sample set, which reflects the network learning ability, and the test relative error of the test sample set, which represents the network generalization ability; through the simulation of BP network overfit numerical modeling test with different types of functions, it is ascertained that the overfit parameter q in the relation generally has a span of 7 x 10(-3) to 7 x 10(-2); the uncertainty relation then helps to obtain the formula for calculating the number of hidden nodes of a network with good generalization ability under the condition that multiple correlation coefficient is used to describe sample complexity and the given approximation error requirement is satisfied; the rationality of this formula is verified; this paper also points out that applying the BP network to the training process of the given sample set is the best method for stopping training that improves the generalization ability.
引用
收藏
页码:137 / 150
页数:14
相关论文
共 50 条
  • [21] Solution of an optimal routing problem by reinforcement learning with generalization ability
    Iima H.
    Oonishi H.
    IEEJ Transactions on Electronics, Information and Systems, 2019, 139 (12) : 1494 - 1500
  • [22] Enhancing Generalization Ability in Deepfake Detection via Continual Learning
    Usmani, Shaheen
    Kumar, Sunil
    Sadhya, Debanjan
    PROCEEDINGS OF FIFTEENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING, ICVGIP 2024, 2024,
  • [23] A learning algorithm for enhancing the generalization ability of support vector machines
    Guo, J
    Takahashi, N
    Nishi, T
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 3631 - 3634
  • [24] Uncertainty relation suited to overfitting of BP neural network
    Li, ZY
    Deng, XM
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2000, 19 (02) : 142 - 144
  • [25] The Evaluation of Scientific Reasoning Ability Based on BP Neural Network
    Peng, Liangyu
    Bao, Lei
    Du, Chunhui
    HIGH PERFORMANCE NETWORKING, COMPUTING, AND COMMUNICATION SYSTEMS, 2011, 163 : 133 - +
  • [26] Analysis of quantum neural network learning ability
    Zhong, Yan-hua
    Man, Chang-qing
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 679 - 683
  • [27] SteganoCNN: Image Steganography with Generalization Ability Based on Convolutional Neural Network
    Duan, Xintao
    Liu, Nao
    Gou, Mengxiao
    Wang, Wenxin
    Qin, Chuan
    ENTROPY, 2020, 22 (10) : 1 - 15
  • [28] Regularization theory in the study of generalization ability of a biological neural network model
    Aleksandra Świetlicka
    Advances in Computational Mathematics, 2019, 45 : 1793 - 1805
  • [29] A deep convolutional neural network for topology optimization with perceptible generalization ability
    Wang, Dalei
    Xiang, Cheng
    Pan, Yue
    Chen, Airong
    Zhou, Xiaoyi
    Zhang, Yiquan
    ENGINEERING OPTIMIZATION, 2022, 54 (06) : 973 - 988
  • [30] Regularization theory in the study of generalization ability of a biological neural network model
    Swietlicka, Aleksandra
    ADVANCES IN COMPUTATIONAL MATHEMATICS, 2019, 45 (04) : 1793 - 1805