On the Asymptotic Learning Curves of Kernel Ridge Regression under Power-law Decay

被引:0
|
作者
Li, Yicheng [1 ]
Zhang, Haobo [1 ]
Lin, Qian [1 ]
机构
[1] Tsinghua Univ, Ctr Stat Sci, Dept Ind Engn, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The widely observed 'benign overfitting phenomenon' in the neural network literature raises the challenge to the 'bias-variance trade-off' doctrine in the statistical learning theory. Since the generalization ability of the 'lazy trained' over-parametrized neural network can be well approximated by that of the neural tangent kernel regression, the curve of the excess risk (namely, the learning curve) of kernel ridge regression attracts increasing attention recently. However, most recent arguments on the learning curve are heuristic and are based on the 'Gaussian design' assumption. In this paper, under mild and more realistic assumptions, we rigorously provide a full characterization of the learning curve in the asymptotic sense under a power-law decay condition of the eigenvalues of the kernel and also the target function. The learning curve elaborates the effect and the interplay of the choice of the regularization parameter, the source condition and the noise. In particular, our results suggest that the 'benign overfitting phenomenon' exists in over-parametrized neural networks only when the noise level is small.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] CASCADE PRODUCTION OF POWER-LAW DECAY CURVES
    ELLIS, DG
    [J]. PHYSICS LETTERS A, 1980, 80 (5-6) : 375 - 376
  • [2] Robust object tracking based on power-law probability map and ridge regression
    Zhiqiang Zhao
    Zhiliang Zhu
    Meng Yan
    Bin Wu
    Zhijian Zhao
    [J]. Multimedia Tools and Applications, 2024, 83 : 23047 - 23065
  • [3] Robust object tracking based on power-law probability map and ridge regression
    Zhao, Zhiqiang
    Zhu, Zhiliang
    Yan, Meng
    Wu, Bin
    Zhao, Zhijian
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23047 - 23065
  • [4] An explanation of the power-law decay of luminescence
    Huntley, DJ
    [J]. JOURNAL OF PHYSICS-CONDENSED MATTER, 2006, 18 (04) : 1359 - 1365
  • [5] DECAY OF TURBULENCE IN A POWER-LAW FLUID
    SHERWOOD, JD
    [J]. PHYSICS OF FLUIDS, 1985, 28 (02) : 753 - 754
  • [6] Bayesian power-law regression with a location parameter, with applications for construction of discharge rating curves
    Trond Reitan
    Asgeir Petersen-Øverleir
    [J]. Stochastic Environmental Research and Risk Assessment, 2008, 22 : 351 - 365
  • [7] Bayesian power-law regression with a location parameter, with applications for construction of discharge rating curves
    Reitan, Trond
    Petersen-Overleir, Asgeir
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2008, 22 (03) : 351 - 365
  • [8] Power-law description of martensite transformation curves
    Guimaraes, J. R. C.
    Rios, P. R.
    Alves, A. L. M.
    [J]. MATERIALS SCIENCE AND TECHNOLOGY, 2021, 37 (17) : 1362 - 1369
  • [9] Power-law decay of doubly ionized ethylene
    Takahashi, K.
    Yokokawa, K.
    Mizumura, A.
    Matsumoto, J.
    Shiromaru, H.
    Kumar, H.
    Bhatt, P.
    Safvan, C. P.
    [J]. PHYSICAL REVIEW A, 2018, 98 (06)
  • [10] Universal power-law decay in Hamiltonian systems?
    Weiss, M
    Hufnagel, L
    Ketzmerick, R
    [J]. PHYSICAL REVIEW LETTERS, 2002, 89 (23)