Beyond the Storage Capacity: Data-Driven Satisfiability Transition

被引:13
|
作者
Rotondo, Pietro [1 ,2 ]
Pastore, Mauro [1 ,2 ]
Gherardi, Marco [1 ,2 ]
机构
[1] Ist Nazl Fis Nucl, Sez Milano, Via Celoria 16, I-20133 Milan, Italy
[2] Univ Milan, Via Celoria 16, I-20133 Milan, Italy
基金
欧盟地平线“2020”;
关键词
Data structure has a dramatic impact on the properties of neural networks; yet its significance in the established theoretical frameworks is poorly understood. Here we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labeled subsets. At variance with the unstructured scenario; entropy is nonmonotonic in the size of the training set; and displays an additional critical point besides the storage capacity. Remarkably; the same behavior occurs in margin classifiers even with randomly labeled data; as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity; and they indicate the path towards more realistic bounds for the generalization error of neural networks. © 2020 American Physical Society;
D O I
10.1103/PhysRevLett.125.120601
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Data structure has a dramatic impact on the properties of neural networks, yet its significance in the established theoretical frameworks is poorly understood. Hem we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labeled subsets. At variance with the unstructured scenario, entropy is nonmonotonic in the size of the training set, and displays an additional critical point besides the storage capacity. Remarkably, the same behavior occurs in margin classifiers even with randomly labeled data, as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity, and they indicate the path towards more realistic bounds for the generalization error of neural networks.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Data-Driven Approach for Building the Profile of Water Storage Capacity of Soils
    Zhou, Jiang
    Briciu-Burghina, Ciprian
    Regan, Fiona
    Ali, Muhammad Intizar
    SENSORS, 2023, 23 (12)
  • [2] Data-driven cellular capacity optimization
    Egbert, Robert
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [3] Data-Driven Design: Beyond A/B Testing
    Kumar, Ranjitha
    PROCEEDINGS OF THE 2019 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL (CHIIR'19), 2019, : 1 - 2
  • [4] Data-Driven Capacity Bidding for Frequency Regulation
    Li, Sen
    Qin, Junjie
    Shetty, Akhil
    Poolla, Kameshwar
    Varaiya, Pravin
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 3901 - 3908
  • [5] Data-Driven Estimation of Capacity Upper Bounds
    Hager, Christian
    Agrell, Erik
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (12) : 2939 - 2943
  • [6] Data-Driven Serverless Functions for Object Storage
    Sampe, Josep
    Sanchez-Artigas, Marc
    Garcia-Lopez, Pedro
    Paris, Gerard
    PROCEEDINGS OF THE 2017 INTERNATIONAL MIDDLEWARE CONFERENCE (MIDDLEWARE'17), 2017, : 121 - 133
  • [7] Structured Data Storage for Data-Driven Process Optimisation in Bioprinting
    Schmieg, Barbara
    Brandt, Nico
    Schnepp, Vera J.
    Radosevic, Luka
    Gretzinger, Sarah
    Selzer, Michael
    Hubbuch, Juergen
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [8] Data-Driven Tire Capacity Estimation With Experimental Verification
    Xu, Nan
    Hashemi, Ehsan
    Tang, Zepeng
    Khajepour, Amir
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21569 - 21581
  • [9] Data-Driven Evaluation of Building Demand Response Capacity
    Jung, Deokwoo
    Krishna, Varun Badrinath
    Temple, William G.
    Yau, David K. Y.
    2014 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2014, : 541 - 547
  • [10] Data-Driven Capacity Planning for Vehicular Fog Computing
    Mao, Wencan
    Akgul, Ozgur Umut
    Mehrabi, Abbas
    Cho, Byungjin
    Xiao, Yu
    Yla-Jaaski, Antti
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (15) : 13179 - 13194