Assessing Local Generalization Capability in Deep Models

被引:0
|
作者
Wang, Huan [1 ]
Keskar, Nitish Shirish [1 ]
Xiong, Caiming [1 ]
Socher, Richard [1 ]
机构
[1] Salesforce Res, Palo Alto, CA 94301 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While it has not yet been proven, empirical evidence suggests that model generalization is related to local properties of the optima, which can be described via the Hessian. We connect model generalization with the local property of a solution under the PAC-Bayes paradigm. In particular, we prove that model generalization ability is related to the Hessian, the higher-order "smoothness" terms characterized by the Lipschitz constant of the Hessian, and the scales of the parameters. Guided by the proof, we propose a metric to score the generalization capability of a model, as well as an algorithm that optimizes the perturbed model accordingly.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Assessing the capability of CORDEX models in simulating onset of rainfall in West Africa
    Moussa S. Mounkaila
    Babatunde J. Abiodun
    J. ‘Bayo Omotosho
    Theoretical and Applied Climatology, 2015, 119 : 255 - 272
  • [32] The use of maturity models/grids as a tool in assessing product development capability
    Fraser, P
    Moultrie, J
    Gregory, M
    IEMC-2002: IEEE INTERNATIONAL ENGINEERING MANAGEMENT CONFERENCE, VOLS I AND II, PROCEEDINGS: MANAGING TECHNOLOGY FOR THE NEW ECONOMY, 2002, : 244 - 249
  • [33] Cable fault diagnosis with generalization capability using incremental learning and deep convolutional neural network
    Chi, Peng
    Liang, Rui
    Hao, Chuncheng
    Li, Guochang
    Xin, Meng
    Electric Power Systems Research, 2025, 241
  • [34] Spatial generalization ability analysis of deep learning crop classification models
    Ge S.
    Zhang J.
    Zhu S.
    National Remote Sensing Bulletin, 2023, 27 (12) : 2796 - 2814
  • [35] Improving generalization of deep fault detection models in the presence of mislabeled data
    Rombach, Katharina
    Michau, Gabriel
    Fink, Olga
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3103 - 3110
  • [36] On the Generalization of Deep Learning Models for AoA Estimation in Bluetooth Indoor Scenarios
    Pisa, Ivan
    Boquet, Guillem
    Vilajosana, Xavier
    Martinez, Borja
    INTERNET OF THINGS, 2024, 26
  • [37] Selecting representative cases by generalization capability
    Tsang, ECC
    Wang, XZ
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 2617 - 2622
  • [38] Assessing the Generalization Capabilities of Neural Machine Translation Models for SPARQL Query Generation
    Reyd, Samuel
    Zouaq, Amal
    SEMANTIC WEB, ISWC 2023, PART I, 2023, 14265 : 484 - 501
  • [39] Research on generalization capability of load model
    Ma, Jin
    He, Ren-Mu
    Zhou, Yan-Jun
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2006, 26 (21): : 29 - 35
  • [40] Improving the generalization capability of the binary CMAC
    Szabó, T
    Horváth, G
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, : 85 - 90