Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

被引:0
|
作者
Hu, Zhengmian [1 ]
Huang, Heng [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20740 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel algorithm, Transformative Bayesian Learning (TransBL), which bridges the gap between empirical risk minimization (ERM) and Bayesian learning for neural networks. We compare ERM, which uses gradient descent to optimize, and Bayesian learning with importance sampling for their generalization and computational complexity. We derive the first algorithm-dependent PAC-Bayesian generalization bound for infinitely wide networks based on an exact KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior. Moreover, we show how to transform gradient-based optimization into importance sampling by incorporating a weight. While Bayesian learning has better generalization, it suffers from low sampling efficiency. Optimization methods, on the other hand, have good sampling efficiency but poor generalization. Our proposed algorithm TransBL enables a trade-off between generalization and sampling efficiency.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Throughput-delay trade-off in wireless networks
    El Gamal, A
    Mammen, J
    Prabhakar, B
    Shah, D
    IEEE INFOCOM 2004: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2004, : 464 - 475
  • [42] Delay and Throughput Trade-Off in WiMAX Mesh Networks
    Bastani, Saeed
    Yousefi, Saleh
    Mazoochi, Mojtaba
    Ghiamatyoun, Alireza
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS, 2009, : 283 - +
  • [43] On the Throughput-Delay Trade-off in Georouting Networks
    Jacquet, Philippe
    Malik, Salman
    Mans, Bernard
    Silva, Alonso
    2012 PROCEEDINGS IEEE INFOCOM, 2012, : 765 - 773
  • [44] Trade-off and Optimization of Supply Chain Carbon Management
    Wang Yunqi
    ENTERPRISE GROWS IN SUSTAINING EFFICIENCY AND EFFECTIVENESS: 2010 INTERNATIONAL CONFERENCE ON THE DEVELOPMENT OF SMALL AND MEDIUM-SIZED ENTERPRISES, 2010, : 221 - 226
  • [45] Modeling and Optimization Trade-off in Meta-learning
    Gao, Katelyn
    Sener, Ozan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [46] Analyzing the Energy-Latency-Area-Accuracy Trade-off Across Contemporary Neural Networks
    Jain, Vikram
    Mei, Linyan
    Verhelst, Marian
    2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [47] Dynamic Energy-Accuracy Trade-off Using Stochastic Computing in Deep Neural Networks
    Kim, Kyounghoon
    Kim, Jungki
    Yu, Joonsang
    Seo, Jungwoo
    Lee, Jongeun
    Choi, Kiyoung
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [48] Trade-off analysis using synthetic training data for neural networks in the automotive development process
    Pfeffer, Raphael
    Bredow, Kai
    Sax, Eric
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 4115 - 4120
  • [49] Optimized artificial neural network assisted trade-off between transmission and delay in LTE networks
    Shanthi, D. L.
    Arumugam, K.
    Swamy, V. M. M.
    Farithkhan, A.
    Manikandan, R.
    Saravanan, D.
    MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 1790 - 1794
  • [50] Can we have it all? On the Trade-off between Spatial and Adversarial Robustness of Neural Networks
    Kamath, Sandesh
    Deshpande, Amit
    Subrahmanyam, K. V.
    Balasubramanian, Vineeth N.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34