Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

被引:0
|
作者
Hu, Zhengmian [1 ]
Huang, Heng [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20740 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel algorithm, Transformative Bayesian Learning (TransBL), which bridges the gap between empirical risk minimization (ERM) and Bayesian learning for neural networks. We compare ERM, which uses gradient descent to optimize, and Bayesian learning with importance sampling for their generalization and computational complexity. We derive the first algorithm-dependent PAC-Bayesian generalization bound for infinitely wide networks based on an exact KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior. Moreover, we show how to transform gradient-based optimization into importance sampling by incorporating a weight. While Bayesian learning has better generalization, it suffers from low sampling efficiency. Optimization methods, on the other hand, have good sampling efficiency but poor generalization. Our proposed algorithm TransBL enables a trade-off between generalization and sampling efficiency.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] A trade-off formula in designing asymmetric neural networks
    Xiong, Daxing
    Zhao, Hong
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2013, 46 (43)
  • [2] ACCURACY-ROBUSTNESS TRADE-OFF FOR POSITIVELY WEIGHTED NEURAL NETWORKS
    Neacsu, Ana
    Pesquet, Jean-Christophe
    Burileanu, Corneliu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8389 - 8393
  • [3] A Dynamical View on Optimization Algorithms of Overparameterized Neural Networks
    Bu, Zhiqi
    Xu, Shiyun
    Chen, Kan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [4] Trade-off studies in blackbox optimization
    Audet, Charles
    Dennis, J. E., Jr.
    Le Digabel, Sebastien
    OPTIMIZATION METHODS & SOFTWARE, 2012, 27 (4-5): : 613 - 624
  • [5] Runtime Configurable Deep Neural Networks for Energy-Accuracy Trade-off
    Tann, Hokchhay
    Hashemi, Soheil
    Bahar, R. Iris
    Reda, Sherief
    2016 INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS (CODES+ISSS), 2016,
  • [6] Performance Metric Sensitivity Computation for Optimization and Trade-off Analysis in Wireless Networks
    Baras, John S.
    Tabatabaee, Vahid
    Papageorgiou, George
    Rentz, Nicolas
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [7] Universality of efficiency at unified trade-off optimization
    Zhang, Yanchao
    Huang, Chuankun
    Lin, Guoxing
    Chen, Jincan
    PHYSICAL REVIEW E, 2016, 93 (03)
  • [8] The efficiency/security trade-off in criminal networks
    Morselli, Carlo
    Giguere, Cynthia
    Petit, Katia
    SOCIAL NETWORKS, 2007, 29 (01) : 143 - 153
  • [9] Rate-Accuracy Trade-Off in Video Classification With Deep Convolutional Neural Networks
    Jubran, Mohammad
    Abbas, Alhabib
    Chadha, Aaron
    Andreopoulos, Yiannis
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 145 - 154
  • [10] Understanding the Energy vs. Adversarial Robustness Trade-Off in Deep Neural Networks
    Lee, Kyungmi
    Chandrakasan, Anantha P.
    2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 46 - 51