To understand double descent, we need to understand VC theory

被引:3
|
作者
Cherkassky, Vladimir [1 ]
Lee, Eng Hock [1 ]
机构
[1] Univ Minnesota Twin Cities, Dept Elect & Comp Engn, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
Double descent; Deep learning; Complexity control; Structural risk minimization; VC-dimension; VC-generalization bounds; NEURAL-NETWORKS; BOUNDS; DIMENSION;
D O I
10.1016/j.neunet.2023.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We analyze generalization performance of over-parameterized learning methods for classification, under VC theoretical framework. Recently, practitioners in Deep Learning discovered 'double descent' phenomenon, when large networks can fit perfectly available training data, and at the same time, achieve good generalization for future (test) data. The current consensus view is that VC-theoretical results cannot account for good generalization performance of Deep Learning networks. In contrast, this paper shows that double descent can be explained by VC-theoretical concepts, such as VC-dimension and Structural Risk Minimization. We also present empirical results showing that double descent generalization curves can be accurately modeled using classical VC-generalization bounds. Proposed VC-theoretical analysis enables better understanding of generalization curves for data sets with different statistical characteristics, such as low vs high-dimensional data and noisy data. In addition, we analyze generalization performance of transfer learning using pre-trained Deep Learning networks.
引用
收藏
页码:242 / 256
页数:15
相关论文
共 50 条
  • [41] We need to understand why viral infections lead to acute asthma
    Wark, Peter A. B.
    EUROPEAN RESPIRATORY JOURNAL, 2022, 60 (01)
  • [42] Do We Need to Understand Social Robots to Grant Them Rights?
    Gerdes, Anne
    CULTURALLY SUSTAINABLE SOCIAL ROBOTICS, 2020, 335 : 575 - 577
  • [44] Why do we need supercomputers to understand the electrocardiographic T wave?
    Potse, Mark
    LeBlanc, A. -Robert
    Vinet, Alain
    ANADOLU KARDIYOLOJI DERGISI-THE ANATOLIAN JOURNAL OF CARDIOLOGY, 2007, 7 : 123 - 124
  • [45] Malperfusion in aortic dissections: a game of pressures we need to understand better
    Tsilimparis, Nikolaos
    Spath, Paolo
    JOURNAL OF CARDIOVASCULAR SURGERY, 2022, 63 (02): : 115 - 116
  • [46] Why we need to see the dark matter to understand the dark energy
    Kunz, Martin
    2007 EUROPHYSICS CONFERENCE ON HIGH ENERGY PHYSICS, PTS 1-12, 2008, 110
  • [47] Covid-19: We need to understand the risks to tackle them
    Godlee, Fiona
    BMJ-BRITISH MEDICAL JOURNAL, 2020, 371
  • [48] Titin cardiomyopathy: why we need to go big to understand the giant
    Tayal, Upasana
    Prasad, Sanjay K.
    EUROPEAN HEART JOURNAL, 2018, 39 (10) : 874 - 875
  • [49] Why we need to understand the mechanics of developing cardiac sarcomeres in humans
    Solaro, R. John
    JOURNAL OF PHYSIOLOGY-LONDON, 2016, 594 (02): : 255 - 255
  • [50] THE NEED TO UNDERSTAND SYSTEMS OF SYSTEMS
    Siemieniuch, C. E.
    Sinclair, M. A.
    Henshaw, M. J. de C.
    CONTEMPORARY ERGONOMICS AND HUMAN FACTORS 2013, 2013, : 211 - 218