A Theoretical-Empirical Approach to Estimating Sample Complexity of DNNs

被引:1
|
作者
Bisla, Devansh [1 ]
Saridena, Apoorva Nandini [1 ]
Choromanska, Anna [1 ]
机构
[1] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, New York, NY 10012 USA
关键词
BOUNDS; NUMBER;
D O I
10.1109/CVPRW53098.2021.00365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on understanding how the generalization error scales with the amount of the training data for deep neural networks (DNNs). Existing techniques in statistical learning theory require a computation of capacity measures, such as VC dimension, to provably bound this error. It is however unclear how to extend these measures to DNNs and therefore the existing analyses are applicable to simple neural networks, which are not used in practice, e.g., linear or shallow (at most two-layer) ones or otherwise multi-layer perceptrons. Moreover many theoretical error bounds are not empirically verifiable. In this paper we derive estimates of the generalization error that hold for deep networks and do not rely on unattainable capacity measures. The enabling technique in our approach hinges on two major assumptions: i) the network achieves zero training error, ii) the probability of making an error on a test point is proportional to the distance between this point and its nearest training point in the feature space and at certain maximal distance (that we call radius) it saturates. Based on these assumptions we estimate the generalization error of DNNs. The obtained estimate scales as O (1/delta N-1/d), where N is the size of the training data, and is parameterized by two quantities, the effective dimensionality of the data as perceived by the network (d) and the aforementioned radius (delta), both of which we find empirically. We show that our estimates match with the experimentally-obtained behavior of the error on multiple learning tasks using benchmark data-sets and realistic models. Estimating training data requirements is essential for deployment of safety critical applications such as autonomous driving, medical diagnostics etc. Furthermore, collecting and annotating training data requires a huge amount of financial, computational and human resources. Our empirical estimates will help to efficiently allocate resources.
引用
收藏
页码:3264 / 3274
页数:11
相关论文
共 50 条
  • [21] MONETARY AND FINANCIAL POLICY OF UKRAINE: THEORETICAL-EMPIRICAL CONNECTIONS AND PRIORITIES OF STATE REGULATION
    Vasyltsiv, T. H.
    Klipkova, O., I
    Lupak, R. L.
    Mitsenko, N. G.
    Mishchuk, I. P.
    [J]. FINANCIAL AND CREDIT ACTIVITY-PROBLEMS OF THEORY AND PRACTICE, 2019, 4 (31): : 320 - 330
  • [22] A Theoretical-Empirical Analysis on the Initial Dissolution Rate of Drugs from Polydispersed Particles
    Takano, Ryusuke
    Takata, Noriyuki
    Shiraki, Koji
    Higo, Shoichi
    Hayashi, Yoshiki
    Yamashita, Shinji
    [J]. BIOLOGICAL & PHARMACEUTICAL BULLETIN, 2009, 32 (11) : 1885 - 1891
  • [23] Establishment and application of theoretical-empirical prediction model of VMA in hot mix asphalt mixture
    Zhang, Huiqin
    Ma, Zhenghao
    Ji, Ping
    Bi, Yufeng
    Cao, Weidong
    Liu, Shutang
    [J]. CASE STUDIES IN CONSTRUCTION MATERIALS, 2023, 18
  • [24] T. Parsons' "SocietalCommunity" inG. Sciortino's Theoretical-empirical Interpretation
    Trotsu, Irina
    [J]. SOCIOLOGICESKOE OBOZRENIE, 2024, 23 (02): : 204 - 230
  • [25] On the Sample Complexity of Estimating Small Singular Modes
    Xu, Xiangxiang
    Wang, Weida
    Huang, Shao-Lun
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 1189 - 1194
  • [26] The growth rebound effect: A theoretical-empirical investigation into the relation between rebound effects and economic growth
    Lange, Steffen
    Berner, Anne
    [J]. JOURNAL OF CLEANER PRODUCTION, 2022, 371
  • [27] Communication as a Challenge. A theoretical-empirical Study on Coach-Athlete Communication in Elite Sports
    Koenig, Stefan
    [J]. EUROPEAN JOURNAL FOR SPORT AND SOCIETY, 2015, 12 (04) : 421 - 426
  • [28] Transnational social spaces. A theoretical-empirical sketch based on Mexican-American labor migration
    Pries, L
    [J]. ZEITSCHRIFT FUR SOZIOLOGIE, 1996, 25 (06): : 456 - +
  • [29] Female delinquency.: Theoretical-empirical study to the problems of legal standards and value orientation of delinquent females
    Ehnertová, E
    [J]. SOCIOLOGICKY CASOPIS-CZECH SOCIOLOGICAL REVIEW, 2006, 42 (01): : 229 - 232
  • [30] Looking up and down and round and round: a theoretical-empirical, individual-level analysis of income comparisons
    Lehr, Alex
    [J]. SOCIO-ECONOMIC REVIEW, 2023, 22 (02) : 501 - 532