Empirical risk minimization in the interpolating regime with application to neural network learning

被引:0
|
作者
Muecke, Nicole [1 ]
Steinwart, Ingo [2 ]
机构
[1] Tech Univ Carolo Wilhelmina Braunschweig, Inst Math Stochast, Braunschweig, Germany
[2] Univ Stuttgart, Inst Stochast & Applicat, Stuttgart, Germany
关键词
Neural network learning; Overparameterization; Interpolation; Empirical risk minimization; DEEP; CONVERGENCE;
D O I
10.1007/s10994-025-06738-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A common strategy to train deep neural networks (DNNs) is to use very large architectures and to train them until they (almost) achieve zero training error. Empirically observed good generalization performance on test data, even in the presence of lots of label noise, corroborate such a procedure. On the other hand, in statistical learning theory it is known that over-fitting models may lead to poor generalization properties, occurring in e.g. empirical risk minimization (ERM) over too large hypotheses classes. Inspired by this contradictory behavior, so-called interpolation methods have recently received much attention, leading to consistent and optimally learning methods for, e.g., some local averaging schemes with zero training error. We extend this analysis to ERM-like methods for least squares regression and show that for certain, large hypotheses classes called inflated histograms, some interpolating empirical risk minimizers enjoy very good statistical guarantees while others fail in the worst sense. Moreover, we show that the same phenomenon occurs for DNNs with zero training error and sufficiently large architectures.
引用
收藏
页数:52
相关论文
共 50 条
  • [41] Application of a neural network in the empirical design of underground excavation spans
    Wang, J.
    Milne, D.
    Pakalnis, R.
    Transactions of the Institutions of Mining and Metallurgy, Section A: Mining Technology, 2002, 111
  • [42] Application of a neural network in the empirical design of underground excavation spans
    Wang, J
    Milne, D
    Pakalnis, R
    TRANSACTIONS OF THE INSTITUTION OF MINING AND METALLURGY SECTION A-MINING TECHNOLOGY, 2002, 111 : A73 - A81
  • [43] Interpolating paleovegetation data with an artificial neural network approach
    Grieger, B
    GLOBAL AND PLANETARY CHANGE, 2002, 34 (3-4) : 199 - 208
  • [44] Application of Neural Network in Seismic Risk Survey
    Zhou, Changxian
    Ye, Youquan
    Wang, Hao
    Fu, Ping
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 1005 - 1011
  • [46] Energy consumption minimization on LoRaWAN sensor network by using an Artificial Neural Network based application
    Kromes, Roland
    Russo, Adrien
    Miramond, Benoit
    Verdier, Francois
    2019 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2019,
  • [47] Application of RBF Neural Network in Network Security Risk Assessment
    Zheng, Zhao-na
    Sun, Peng
    2011 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND APPLICATIONS, 2011, : 43 - 46
  • [48] Learning ability of interpolating deep convolutional neural networks
    Zhou, Tian-Yi
    Huo, Xiaoming
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2024, 68
  • [49] Excess risk bounds in robust empirical risk minimization
    Mathieu, Timothee
    Minsker, Stanislav
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2021, 10 (04) : 1423 - 1490
  • [50] Insuring against the perils in distributed learning: privacy-preserving empirical risk minimization
    Owusu-Agyemang, Kwabena
    Qin, Zhen
    Benjamin, Appiah
    Xiong, Hu
    Qin, Zhiguang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 3006 - 3033