Generalization Error in Deep Learning

被引:49
|
作者
Jakubovitz, Daniel [1 ]
Giryes, Raja [1 ]
Rodrigues, Miguel R. D. [2 ]
机构
[1] Tel Aviv Univ, Sch Elect Engn, Tel Aviv, Israel
[2] UCL, Dept Elect & Elect Engn, London, England
关键词
SAMPLE COMPLEXITY; SPARSE;
D O I
10.1007/978-3-319-73074-5_5
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still generally unclear what is the source of their generalization ability. Thus, an important question is what makes deep neural networks able to generalize well from the training set to new data. In this chapter, we provide an overview of the existing theory and bounds for the characterization of the generalization error of deep neural networks, combining both classical and more recent theoretical and empirical results.
引用
收藏
页码:153 / 193
页数:41
相关论文
共 50 条
  • [1] Generalization Error Bounds on Deep Learning with Markov Datasets
    Truong, Lan V.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] Fast generalization error bound of deep learning from a kernel perspective
    Suzuki, Taiji
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [3] Regression with Deep Neural Networks: Generalization Error Guarantees, Learning Algorithms, and Regularizers
    Amjad, Jaweria
    Lyu, Zhaoyan
    Rodrigues, Miguel R. D.
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1481 - 1485
  • [4] Ordalia: Deep Learning Hyperparameter Search via Generalization Error Bounds Extrapolation
    Buratti, Benedetto J.
    Upfal, Eli
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 180 - 187
  • [5] Generalization and learning error for nonlinear perceptron
    Shcherbina, M
    Tirozzi, B
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2002, 35 (3-4) : 259 - 271
  • [6] A generalization error for Q-learning
    Murphy, Susan A.
    [J]. Journal of Machine Learning Research, 2005, 6
  • [7] Tradeoff of generalization error in unsupervised learning
    Kim, Gilhan
    Lee, Hojun
    Jo, Junghyo
    Baek, Yongjoo
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2023, 2023 (08):
  • [8] A generalization error for Q-learning
    Murphy, SA
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 1073 - 1097
  • [9] Quantifying the generalization error in deep learning in terms of data distribution and neural network smoothness
    Jin, Pengzhan
    Lu, Lu
    Tang, Yifa
    Karniadakis, George Em
    [J]. NEURAL NETWORKS, 2020, 130 : 85 - 99
  • [10] Fast generalization error bound of deep learning without scale invariance of activation functions
    Terada, Yoshikazu
    Hirose, Ryoma
    [J]. NEURAL NETWORKS, 2020, 129 : 344 - 358