Beyond the Storage Capacity: Data-Driven Satisfiability Transition

被引:13
|
作者
Rotondo, Pietro [1 ,2 ]
Pastore, Mauro [1 ,2 ]
Gherardi, Marco [1 ,2 ]
机构
[1] Ist Nazl Fis Nucl, Sez Milano, Via Celoria 16, I-20133 Milan, Italy
[2] Univ Milan, Via Celoria 16, I-20133 Milan, Italy
基金
欧盟地平线“2020”;
关键词
Data structure has a dramatic impact on the properties of neural networks; yet its significance in the established theoretical frameworks is poorly understood. Here we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labeled subsets. At variance with the unstructured scenario; entropy is nonmonotonic in the size of the training set; and displays an additional critical point besides the storage capacity. Remarkably; the same behavior occurs in margin classifiers even with randomly labeled data; as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity; and they indicate the path towards more realistic bounds for the generalization error of neural networks. © 2020 American Physical Society;
D O I
10.1103/PhysRevLett.125.120601
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Data structure has a dramatic impact on the properties of neural networks, yet its significance in the established theoretical frameworks is poorly understood. Hem we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labeled subsets. At variance with the unstructured scenario, entropy is nonmonotonic in the size of the training set, and displays an additional critical point besides the storage capacity. Remarkably, the same behavior occurs in margin classifiers even with randomly labeled data, as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity, and they indicate the path towards more realistic bounds for the generalization error of neural networks.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Beyond Isolation: Research Opportunities in Declarative Data-Driven Coordination
    Kot, Lucja
    Gupta, Nitin
    Roy, Sudip
    Gehrke, Johannes
    Koch, Christoph
    SIGMOD RECORD, 2010, 39 (01) : 27 - 32
  • [22] Data-driven Automatic Generation Control capacity prediction method
    Wang, Shuo
    Kong, Xiangyu
    Liu, Mao
    Shi, Haobo
    Wang, Xi
    Dai, Qian
    2022 25TH INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS 2022), 2022,
  • [23] Influence of rain on motorway road capacity - a data-driven analysis
    Calvert, S. C.
    Snelder, M.
    2013 16TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS - (ITSC), 2013, : 1481 - 1486
  • [24] Capacity Allocation in a Service System: Parametric and Data-Driven Approaches
    Liang, Liping
    Xiao, Guanlian
    Ye, Hengqing
    DIGITAL HUMAN MODELING: APPLICATIONS IN HEALTH, SAFETY, ERGONOMICS, AND RISK MANAGEMENT: ERGONOMICS AND DESIGN, 2017, 10286 : 295 - 307
  • [25] A Data-Driven Method for Predicting Capacity Degradation of Rechargeable Batteries
    Pajovic, Milutin
    Orlik, Philip V.
    Wada, Toshihiro
    Takegami, Tomoki
    2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1259 - 1265
  • [26] A data-driven path planning model for crowd capacity analysis
    Tan, Sing Kuang
    Hu, Nan
    Cai, Wentong
    JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 34 : 66 - 79
  • [27] A Data-driven Approach to Optimize Bounds on the Capacity of the Molecular Channel
    Ratti, Francesca
    Scalia, Gabriele
    Pernici, Barbara
    Magarini, Maurizio
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [28] Comparison of two storage models in data-driven multithreaded architectures
    Annavaram, M
    Najjar, WA
    EIGHTH IEEE SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1996, : 122 - 129
  • [29] A data-driven optimization model for the scattered storage assignment with replenishment
    Wang, Meng
    Liu, Xiang
    Wang, Liping
    Bian, Yunqi
    Fan, Kun
    Zhang, Ren-Qian
    COMPUTERS & INDUSTRIAL ENGINEERING, 2025, 200
  • [30] DATA-DRIVEN
    Lev-Ram, Michal
    FORTUNE, 2016, 174 (05) : 76 - 81