Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

被引：0

作者：

Hu, Zhengmian ^{[1
]}

Huang, Heng ^{[1
]}

机构：

[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20740 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel algorithm, Transformative Bayesian Learning (TransBL), which bridges the gap between empirical risk minimization (ERM) and Bayesian learning for neural networks. We compare ERM, which uses gradient descent to optimize, and Bayesian learning with importance sampling for their generalization and computational complexity. We derive the first algorithm-dependent PAC-Bayesian generalization bound for infinitely wide networks based on an exact KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior. Moreover, we show how to transform gradient-based optimization into importance sampling by incorporating a weight. While Bayesian learning has better generalization, it suffers from low sampling efficiency. Optimization methods, on the other hand, have good sampling efficiency but poor generalization. Our proposed algorithm TransBL enables a trade-off between generalization and sampling efficiency.

引用

页数：26

共 50 条

[41] Throughput-delay trade-off in wireless networks
El Gamal, A
Mammen, J
Prabhakar, B
Shah, D
IEEE INFOCOM 2004: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2004, : 464 - 475
[42] Delay and Throughput Trade-Off in WiMAX Mesh Networks
Bastani, Saeed
Yousefi, Saleh
Mazoochi, Mojtaba
Ghiamatyoun, Alireza
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS, 2009, : 283 - +
[43] On the Throughput-Delay Trade-off in Georouting Networks
Jacquet, Philippe
Malik, Salman
Mans, Bernard
Silva, Alonso
2012 PROCEEDINGS IEEE INFOCOM, 2012, : 765 - 773
[44] Trade-off and Optimization of Supply Chain Carbon Management
Wang Yunqi
ENTERPRISE GROWS IN SUSTAINING EFFICIENCY AND EFFECTIVENESS: 2010 INTERNATIONAL CONFERENCE ON THE DEVELOPMENT OF SMALL AND MEDIUM-SIZED ENTERPRISES, 2010, : 221 - 226
[45] Modeling and Optimization Trade-off in Meta-learning
Gao, Katelyn
Sener, Ozan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[46] Analyzing the Energy-Latency-Area-Accuracy Trade-off Across Contemporary Neural Networks
Jain, Vikram
Mei, Linyan
Verhelst, Marian
2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
[47] Dynamic Energy-Accuracy Trade-off Using Stochastic Computing in Deep Neural Networks
Kim, Kyounghoon
Kim, Jungki
Yu, Joonsang
Seo, Jungwoo
Lee, Jongeun
Choi, Kiyoung
2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
[48] Trade-off analysis using synthetic training data for neural networks in the automotive development process
Pfeffer, Raphael
Bredow, Kai
Sax, Eric
2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 4115 - 4120
[49] Optimized artificial neural network assisted trade-off between transmission and delay in LTE networks
Shanthi, D. L.
Arumugam, K.
Swamy, V. M. M.
Farithkhan, A.
Manikandan, R.
Saravanan, D.
MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 1790 - 1794
[50] Can we have it all? On the Trade-off between Spatial and Adversarial Robustness of Neural Networks
Kamath, Sandesh
Deshpande, Amit
Subrahmanyam, K. V.
Balasubramanian, Vineeth N.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →