A PAC-Bayesian Analysis of Randomized Learning with Application to Stochastic Gradient Descent

被引:0
|
作者
London, Ben [1 ]
机构
[1] Amazon AI, Seattle, WA 98109 USA
关键词
STABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the generalization error of randomized learning algorithms-focusing on stochastic gradient descent (SGD)-using a novel combination of PAC-Bayes and algorithmic stability. Importantly, our generalization bounds hold for all posterior distributions on an algorithm's random hyperparameters, including distributions that depend on the training data. This inspires an adaptive sampling algorithm for SGD that optimizes the posterior at runtime. We analyze this algorithm in the context of our generalization bounds and evaluate it on a benchmark dataset. Our experiments demonstrate that adaptive sampling can reduce empirical risk faster than uniform sampling while also improving out-of-sample accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] PAC-Bayesian stochastic model selection
    McAllester, DA
    [J]. MACHINE LEARNING, 2003, 51 (01) : 5 - 21
  • [2] PAC-Bayesian Stochastic Model Selection
    David A. McAllester
    [J]. Machine Learning, 2003, 51 : 5 - 21
  • [3] PAC-Bayesian Theory for Transductive Learning
    Begin, Luc
    Germain, Pascal
    Laviolette, Francois
    Roy, Jean-Francis
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 105 - 113
  • [4] A PAC-Bayesian Bound for Lifelong Learning
    Pentina, Anastasia
    Lampert, Christoph H.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 991 - 999
  • [5] On the Importance of Gradient Norm in PAC-Bayesian Bounds
    Gat, Itai
    Adi, Yossi
    Schwing, Alexander
    Hazan, Tamir
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [6] PAC-Bayesian theory for stochastic LTI systems
    Eringis, Deividas
    Leth, John
    Tan, Zheng-Hua
    Wisniewski, Rafal
    Esfahani, Alireza Fakhrizadeh
    Petreczky, Mihaly
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6626 - 6633
  • [7] PAC-Bayesian Contrastive Unsupervised Representation Learning
    Nozawa, Kento
    Germain, Pascal
    Guedj, Benjamin
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 21 - 30
  • [8] PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers
    Alquier, P.
    [J]. MATHEMATICAL METHODS OF STATISTICS, 2008, 17 (04) : 279 - 304
  • [9] Wide stochastic networks: Gaussian limit and PAC-Bayesian training
    Clerico, Eugenio
    Deligiannidis, George
    Doucet, Arnaud
    [J]. INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 447 - 470
  • [10] PAC-Bayesian offline Meta-reinforcement learning
    Sun, Zheng
    Jing, Chenheng
    Guo, Shangqi
    An, Lingling
    [J]. APPLIED INTELLIGENCE, 2023, 53 (22) : 27128 - 27147