A Theoretical-Empirical Approach to Estimating Sample Complexity of DNNs

被引:1
|
作者
Bisla, Devansh [1 ]
Saridena, Apoorva Nandini [1 ]
Choromanska, Anna [1 ]
机构
[1] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, New York, NY 10012 USA
关键词
BOUNDS; NUMBER;
D O I
10.1109/CVPRW53098.2021.00365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on understanding how the generalization error scales with the amount of the training data for deep neural networks (DNNs). Existing techniques in statistical learning theory require a computation of capacity measures, such as VC dimension, to provably bound this error. It is however unclear how to extend these measures to DNNs and therefore the existing analyses are applicable to simple neural networks, which are not used in practice, e.g., linear or shallow (at most two-layer) ones or otherwise multi-layer perceptrons. Moreover many theoretical error bounds are not empirically verifiable. In this paper we derive estimates of the generalization error that hold for deep networks and do not rely on unattainable capacity measures. The enabling technique in our approach hinges on two major assumptions: i) the network achieves zero training error, ii) the probability of making an error on a test point is proportional to the distance between this point and its nearest training point in the feature space and at certain maximal distance (that we call radius) it saturates. Based on these assumptions we estimate the generalization error of DNNs. The obtained estimate scales as O (1/delta N-1/d), where N is the size of the training data, and is parameterized by two quantities, the effective dimensionality of the data as perceived by the network (d) and the aforementioned radius (delta), both of which we find empirically. We show that our estimates match with the experimentally-obtained behavior of the error on multiple learning tasks using benchmark data-sets and realistic models. Estimating training data requirements is essential for deployment of safety critical applications such as autonomous driving, medical diagnostics etc. Furthermore, collecting and annotating training data requires a huge amount of financial, computational and human resources. Our empirical estimates will help to efficiently allocate resources.
引用
收藏
页码:3264 / 3274
页数:11
相关论文
共 50 条
  • [1] A THEORETICAL-EMPIRICAL APPROACH TO THE MECHANISM OF PARTICLE ENTRAINMENT FROM FLUIDIZED BEDS
    ZENZ, FA
    WEIL, NA
    [J]. AICHE JOURNAL, 1958, 4 (04) : 472 - 479
  • [2] ANALYSIS OF PHENOMENE THEORETICAL-EMPIRICAL RESEARCHES AGGRESSION AND AGRICULTURE
    Masoch, I. S.
    [J]. SCIENCE AND EDUCATION, 2011, (01):
  • [3] ANALYSIS OF THE DIMENSIONS OF PORTFOLIO IN IT AND RBV: A THEORETICAL-EMPIRICAL DISCUSSION
    Giacomini, Monica Maier
    Pacheco, Edenir
    Rosa, Paula Maier
    Gabriel Rosa, Luiz Henrique
    [J]. REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2013, 3 (03): : 181 - 194
  • [4] NEW FRONTIERS OF MOBILITY AND MIGRATIONS: A THEORETICAL-EMPIRICAL ANALYSIS
    Ribas-Mateos, Natalia
    Cabezon-Fernandez, Maria-Jesus
    [J]. FINISTERRA-REVISTA PORTUGUESA DE GEOGRAFIA, 2021, 56 (117): : 253 - 272
  • [5] Attentional modulation in literary reading: A theoretical-empirical framework
    van de Ven, Inge
    [J]. ORBIS LITTERARUM, 2024, 79 (02) : 184 - 201
  • [6] SPECIFICS OF THE PSYCHOLOGICAL UNDERSTANDING OF JUSTICE IN THEORETICAL-EMPIRICAL ANALYSIS
    Lendelova, Dagmara
    [J]. CLINICAL SOCIAL WORK AND HEALTH INTERVENTION, 2015, 6 (01): : 9 - 18
  • [7] Towards Theoretical Analysis of Transformation Complexity of ReLU DNNs
    Ren, Jie
    Li, Mingjie
    Zhou, Meng
    Chan, Shih-Han
    Zhang, Quanshi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [8] Examining the theoretical-empirical inconsistency on stationarity of shipping freight rate
    Kou, Ying
    Luo, Meifeng
    Zhao, Yifei
    [J]. MARITIME POLICY & MANAGEMENT, 2018, 45 (02) : 145 - 158
  • [9] Irritation of the Obvious. A theoretical-empirical Approach to a Sociology of situational Non-everyday Life
    Bunk, Benjamin
    [J]. ZEITSCHRIFT FUR SOZIOLOGIE DER ERZIEHUNG UND SOZIALISATION, 2020, 40 (03) : 321 - 326
  • [10] A Theoretical-Empirical Model to Predict the Saturated Hydraulic Conductivity of Sand
    Huang, Zhe
    Xu, Haijue
    Bai, Yuchuan
    Yang, Shuqing
    [J]. SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 2019, 83 (01) : 64 - 77