Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

被引:0
|
作者
Hu, Zhengmian [1 ]
Huang, Heng [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20740 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel algorithm, Transformative Bayesian Learning (TransBL), which bridges the gap between empirical risk minimization (ERM) and Bayesian learning for neural networks. We compare ERM, which uses gradient descent to optimize, and Bayesian learning with importance sampling for their generalization and computational complexity. We derive the first algorithm-dependent PAC-Bayesian generalization bound for infinitely wide networks based on an exact KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior. Moreover, we show how to transform gradient-based optimization into importance sampling by incorporating a weight. While Bayesian learning has better generalization, it suffers from low sampling efficiency. Optimization methods, on the other hand, have good sampling efficiency but poor generalization. Our proposed algorithm TransBL enables a trade-off between generalization and sampling efficiency.
引用
收藏
页数:26
相关论文
共 50 条
  • [11] Understanding the Energy vs. Adversarial Robustness Trade-Off in Deep Neural Networks
    Lee, Kyungmi
    Chandrakasan, Anantha P.
    IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS, 2021, 2 : 843 - 855
  • [12] The trade-off
    Rothschild, M
    COMMUNICATIONS NEWS, 2004, 41 (09): : 19 - 21
  • [13] RATE-ACCURACY TRADE-OFF IN VIDEO CLASSIFICATION WITH DEEP CONVOLUTIONAL NEURAL NETWORKS
    Abbas, Alhabib
    Jubran, Mohammad
    Chadha, Aaron
    Andreopoulos, Yiannis
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 793 - 797
  • [14] NO TRADE-OFF
    NICOLINI, M
    NATION, 1977, 224 (20) : 610 - 610
  • [15] TRADE-OFF
    MANKIW, NG
    NEW REPUBLIC, 1991, 204 (13) : 4 - 4
  • [16] Throughput-Delay Trade-Off for Cognitive Radio Networks: A Convex Optimization Perspective
    Hu, Hang
    Zhang, Hang
    Yu, Hong
    ABSTRACT AND APPLIED ANALYSIS, 2014,
  • [17] Online convex optimization in wireless networks and beyond: The feedback-performance trade-off
    Belmega, E. Veronica
    Mertikopoulos, Panayotis
    Negrel, Romain
    2022 20TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2022), 2022, : 298 - 305
  • [18] Uncertainty trade-off and disturbance trade-off for quantum measurements
    Srinivas, M. D.
    Mandayam, Prabha
    CURRENT SCIENCE, 2015, 109 (11): : 2044 - 2051
  • [19] Neural dynamics of the speed-accuracy trade-off
    Dominic Standage
    Da-Hui Wang
    Gunnar Blohm
    BMC Neuroscience, 15 (Suppl 1)
  • [20] On the neural implementation of the speed-accuracy trade-off
    Standage, Dominic
    Blohm, Gunnar
    Dorris, Michael C.
    FRONTIERS IN NEUROSCIENCE, 2014, 8