Assessing Local Generalization Capability in Deep Models

被引:0
|
作者
Wang, Huan [1 ]
Keskar, Nitish Shirish [1 ]
Xiong, Caiming [1 ]
Socher, Richard [1 ]
机构
[1] Salesforce Res, Palo Alto, CA 94301 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While it has not yet been proven, empirical evidence suggests that model generalization is related to local properties of the optima, which can be described via the Hessian. We connect model generalization with the local property of a solution under the PAC-Bayes paradigm. In particular, we prove that model generalization ability is related to the Hessian, the higher-order "smoothness" terms characterized by the Lipschitz constant of the Hessian, and the scales of the parameters. Guided by the proof, we propose a metric to score the generalization capability of a model, as well as an algorithm that optimizes the perturbed model accordingly.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Stacking ensemble with parsimonious base models to improve generalization capability in the characterization of steel bolted components
    Pernia-Espinoza, A.
    Fernandez-Ceniceros, J.
    Antonanzas, J.
    Urraca, R.
    Martinez-de-Pison, F. J.
    APPLIED SOFT COMPUTING, 2018, 70 : 737 - 750
  • [42] Generalization Capability of Neural Network Models for Temperature-Frequency Correlation Using Monitoring Data
    Ni, Y. Q.
    Zhou, H. F.
    Ko, J. M.
    JOURNAL OF STRUCTURAL ENGINEERING, 2009, 135 (10) : 1290 - 1300
  • [43] Enhancing the Generalization of Synthetic Image Detection Models through the Exploration of Features in Deep Detection Models
    Javaheri, Alireza Hajabdollah
    Motamednia, Hossein
    Mahmoudi-Azanveh, Ahmad
    PROCEEDINGS OF THE 13TH IRANIAN/3RD INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, MVIP, 2024, : 199 - 204
  • [44] ASSESSING LOCAL INFLUENCE IN RESTRICTED REGRESSION-MODELS
    PAULA, GA
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1993, 16 (01) : 63 - 79
  • [45] ASSESSING BIAS AND FIT OF GLOBAL AND LOCAL HAZARD MODELS
    WU, LL
    TUMA, NB
    SOCIOLOGICAL METHODS & RESEARCH, 1991, 19 (03) : 354 - 387
  • [46] Assessing Deep Generative Models in Chemical Composition Space
    Tuerk, Hanna
    Landini, Elisabetta
    Kunkel, Christian
    Margraf, Johannes T.
    Reuter, Karsten
    CHEMISTRY OF MATERIALS, 2022, 34 (21) : 9455 - 9467
  • [47] Improving Generalization for Hyperspectral Image Classification: The Impact of Disjoint Sampling on Deep Models
    Ahmad, Muhammad
    Mazzara, Manuel
    Distefano, Salvatore
    Khan, Adil Mehmood
    Altuwaijri, Hamad Ahmed
    Computers, Materials and Continua, 2024, 81 (01): : 503 - 532
  • [48] Generalization Ability of Bagging and Boosting Type Deep Learning Models in Evapotranspiration Estimation
    Kumar, Manoranjan
    Agrawal, Yash
    Adamala, Sirisha
    Subbarao, A. V. M.
    Singh, V. K.
    Srivastava, Ankur
    WATER, 2024, 16 (16)
  • [49] Assessing the VANET's Local Information Storage Capability under Different Traffic Mobility
    Liu, Bojin
    Khorashadi, Behrooz
    Ghosal, Dipak
    Chuah, Chen-Nee
    Zhang, Michael H.
    2010 PROCEEDINGS IEEE INFOCOM, 2010,
  • [50] Transferring deep convolutional neural network models for generalization mapping of autumn crops
    Zhang F.
    Zhang J.
    Duan Y.
    Yang Z.
    National Remote Sensing Bulletin, 2024, 28 (03) : 661 - 676