The Slow Deterioration of the Generalization Error of the Random Feature Model

被引:0
|
作者
Ma, Chao [1 ]
Wu, Lei [1 ]
Weinan, E. [1 ,2 ]
机构
[1] Princeton Univ, PACM, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Math, Princeton, NJ 08544 USA
关键词
Random feature model; gradient descent; generalization error; early stopping; Gram matrix;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The random feature model exhibits a kind of resonance behavior when the number of parameters is close to the training sample size. This behavior is characterized by the appearance of large generalization gap, and is due to the occurrence of very small eigenvalues for the associated Gram matrix. In this paper, we examine the dynamic behavior of the gradient descent algorithm in this regime. We show, both theoretically and experimentally, that there is a dynamic self-correction mechanism at work: The larger the eventual generalization gap, the slower it develops, both because of the small eigenvalues. This gives us ample time to stop the training process and obtain solutions with good generalization property.
引用
收藏
页码:373 / +
页数:18
相关论文
共 50 条
  • [1] Conditioning of random Fourier feature matrices: double descent and generalization error
    Chen, Zhijun
    Schaeffer, Hayden
    [J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2024, 13 (02)
  • [2] Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models
    Yang, Zitong
    Bai, Yu
    Mei, Song
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Generalization error of random feature and kernel methods: Hypercontractivity and kernel matrix concentration
    Mei, Song
    Misiakiewicz, Theodor
    Montanari, Andrea
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2022, 59 : 3 - 84
  • [4] Feature Weighting for RBFNN based on Genetic Algorithm and Localized Generalization Error Model
    Chen, Tianyu
    Wang, Ting
    Ng, Wing W. Y.
    Zhang, Jianjun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 306 - 311
  • [5] The random deterioration rate model with measurement error based on the inverse Gaussian distribution
    Morita, Lia H. M.
    Tomazella, Vera L. D.
    Ramos, Pedro L.
    Ferreira, Paulo H.
    Louzada, Francisco
    [J]. BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2021, 35 (01) : 187 - 204
  • [6] Random Feature Amplification: Feature Learning and Generalization in Neural Networks
    Frei, Spencer
    Chatterji, Niladri S.
    Bartlett, Peter L.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [7] Estimation of generalization error: Random and fixed inputs
    Wang, Junhui
    Shen, Xiaotong
    [J]. STATISTICA SINICA, 2006, 16 (02) : 569 - 588
  • [8] Generalization bounds for sparse random feature expansions ?
    Hashemi, Abolfazl
    Schaeffer, Hayden
    Shi, Robert
    Topcu, Ufuk
    Tran, Giang
    Ward, Rachel
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2023, 62 : 310 - 330
  • [9] Confidence intervals for the random forest generalization error
    Marques, F. Paulo C.
    [J]. PATTERN RECOGNITION LETTERS, 2022, 158 : 171 - 175
  • [10] SENSITIVITY BASED GENERALIZATION ERROR FOR SUPERVISED LEARNING PROBLEM WITH APPLICATIONS IN MODEL SELECTION AND FEATURE SELECTION
    Yeung, Daniel S.
    [J]. ICINCO 2009: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1: INTELLIGENT CONTROL SYSTEMS AND OPTIMIZATION, 2009, : IS5 - IS5