On the role of data in PAC-Bayes bounds

被引:0
|
作者
Dziugaite, Gintare Karolina [1 ,2 ]
Hsu, Kyle [3 ,4 ]
Gharbieh, Waseem [1 ]
Arpino, Gabriel [3 ,4 ]
Roy, Daniel M. [3 ,4 ]
机构
[1] Element AI ServiceNow, Montreal, PQ, Canada
[2] Mila, Montreal, PQ, Canada
[3] Univ Toronto, Toronto, ON, Canada
[4] Vector Inst, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The dominant term in PAC-Bayes bounds is often the Kullback-Leibler divergence between the posterior and prior. For so-called linear PAC-Bayes risk bounds based on the empirical risk of a fixed posterior kernel, it is possible to minimize the expected value of the bound by choosing the prior to be the expected posterior, which we call the oracle prior on the account that it is distribution dependent. In this work, we show that the bound based on the oracle prior can be suboptimal: In some cases, a stronger bound is obtained by using a data-dependent oracle prior, i.e., a conditional expectation of the posterior, given a subset of the training data that is then excluded from the empirical risk term. While using data to learn a prior is a known heuristic, its essential role in optimal bounds is new. In fact, we show that using data can mean the difference between vacuous and nonvacuous bounds. We apply this new principle in the setting of nonconvex learning, simulating data-dependent oracle priors on MNIST and Fashion MNIST with and without held-out data, and demonstrating new nonvacuous bounds in both cases.
引用
收藏
页码:604 / +
页数:11
相关论文
共 50 条
  • [1] Data Dependent Priors in PAC-Bayes Bounds
    Shawe-Taylors, John
    Parrado-Hernandez, Emilio
    Ambroladze, Amiran
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 231 - 240
  • [2] PAC-Bayes Bounds with Data Dependent Priors
    Parrado-Hernandez, Emilio
    Ambroladze, Amiran
    Shawe-Taylor, John
    Sun, Shiliang
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 3507 - 3531
  • [3] PAC-bayes bounds with data dependent priors
    [J]. Parrado-Hernandez, E. (emipar@tsc.uc3m.es), 1600, Microtome Publishing (13):
  • [4] PAC-Bayes Analysis Beyond the Usual Bounds
    Rivasplata, Omar
    Kuzborskij, Ilja
    Szepesvari, Csaba
    Shawe-Taylor, John
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] PAC-Bayes Unleashed: Generalisation Bounds with Unbounded Losses
    Haddouche, Maxime
    Guedj, Benjamin
    Rivasplata, Omar
    Shawe-Taylor, John
    [J]. ENTROPY, 2021, 23 (10)
  • [6] User-friendly Introduction to PAC-Bayes Bounds
    Alquier, Pierre
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2024, 17 (02): : 174 - 303
  • [7] PAC-Bayes bounds for twin support vector machines
    Xie, Xijiong
    Sun, Shiliang
    [J]. NEUROCOMPUTING, 2017, 234 : 137 - 143
  • [8] Generalisation bounds for kernel PCA through PAC-Bayes learning
    Haddouche, Maxime
    Guedj, Benjamin
    Shawe-Taylor, John
    [J]. STAT, 2024, 13 (04):
  • [9] ADAPTIVE META-LEARNING VIA DATA-DEPENDENT PAC-BAYES BOUNDS
    Friedman, Lior
    Meir, Ron
    [J]. CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 796 - 810
  • [10] PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison
    Flynn, Hamish
    Reeb, David
    Kandemir, Melih
    Peters, Jan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15308 - 15327