Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

被引:0
|
作者
Hu, Zhengmian [1 ]
Huang, Heng [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20740 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel algorithm, Transformative Bayesian Learning (TransBL), which bridges the gap between empirical risk minimization (ERM) and Bayesian learning for neural networks. We compare ERM, which uses gradient descent to optimize, and Bayesian learning with importance sampling for their generalization and computational complexity. We derive the first algorithm-dependent PAC-Bayesian generalization bound for infinitely wide networks based on an exact KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior. Moreover, we show how to transform gradient-based optimization into importance sampling by incorporating a weight. While Bayesian learning has better generalization, it suffers from low sampling efficiency. Optimization methods, on the other hand, have good sampling efficiency but poor generalization. Our proposed algorithm TransBL enables a trade-off between generalization and sampling efficiency.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] On generalized trade-off directions in nonconvex multiobjective optimization
    Kaisa Miettinen
    Marko M. Mäkelä
    Mathematical Programming, 2002, 92 : 141 - 151
  • [32] An internet graph model based on trade-off optimization
    Alvarez-Hamelin, JI
    Schabanel, N
    EUROPEAN PHYSICAL JOURNAL B, 2004, 38 (02): : 231 - 237
  • [33] The Role of Regularization in Overparameterized Neural Networks
    Satpathi, Siddhartha
    Gupta, Harsh
    Liang, Shiyu
    Srikant, R.
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 4683 - 4688
  • [34] Global Minima of Overparameterized Neural Networks
    Cooper, Yaim
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (02): : 676 - 691
  • [35] Free trade trade-off
    Freedman, M
    FORBES, 2002, 169 (06): : 44 - 44
  • [36] Trade-off Analysis of Underwater Acoustic Sensor Networks
    Tuna, G.
    Das, R.
    2ND INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTICS ENGINEERING (CACRE 2017), 2017, 235
  • [37] Communication, Computing, and Caching Trade-Off in VR Networks
    Feng, Yuqing
    Wang, Dongyu
    Hou, Yanzhao
    ELECTRONICS, 2023, 12 (07)
  • [38] Diversity-Rate Trade-off in Erasure Networks
    Gharan, Shahab Oveis
    Fashandi, Shervan
    Khandani, Amir K.
    2010 PROCEEDINGS IEEE INFOCOM, 2010,
  • [39] Structure Trade-off Strategy for Local Model Networks
    Hartmann, Benjamin
    Nelles, Oliver
    2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA), 2012, : 451 - 456
  • [40] On the Trade-off between Cost and Availability of Virtual Networks
    Herker, Sandra
    Kiess, Wolfgang
    An, Xueli
    Kirstaedter, Andreas
    2014 IFIP NETWORKING CONFERENCE, 2014,