PAC-Bayesian Contrastive Unsupervised Representation Learning

被引:0
|
作者
Nozawa, Kento [1 ,2 ]
Germain, Pascal [3 ]
Guedj, Benjamin [4 ,5 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] RIKEN, Wako, Saitama, Japan
[3] Univ Laval, Quebec City, PQ, Canada
[4] Inria, Le Chesnay, France
[5] UCL, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contrastive unsupervised representation learning (CURL) is the state-of-the-art technique to learn representations (as a set of features) from unlabelled data. While CURL has collected several empirical successes recently, theoretical understanding of its performance was still missing. In a recent work, Arora et al. (2019) provide the first generalisation bounds for CURL, relying on a Rademacher complexity. We extend their framework to the flexible PAC-Bayes setting, allowing us to deal with the non-iid setting. We present PAC-Bayesian generalisation bounds for CURL, which are then used to derive a new representation learning algorithm. Numerical experiments on real-life datasets illustrate that our algorithm achieves competitive accuracy, and yields nonvacuous generalisation bounds.
引用
下载
收藏
页码:21 / 30
页数:10
相关论文
共 50 条
  • [1] PAC-Bayesian Theory for Transductive Learning
    Begin, Luc
    Germain, Pascal
    Laviolette, Francois
    Roy, Jean-Francis
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 105 - 113
  • [2] A PAC-Bayesian Bound for Lifelong Learning
    Pentina, Anastasia
    Lampert, Christoph H.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 991 - 999
  • [3] PAC-Bayesian Inequalities for Martingales
    Seldin, Yevgeny
    Laviolette, Francois
    Cesa-Bianchi, Nicolo
    Shawe-Taylor, John
    Auer, Peter
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (12) : 7086 - 7093
  • [4] PAC-Bayesian offline Meta-reinforcement learning
    Sun, Zheng
    Jing, Chenheng
    Guo, Shangqi
    An, Lingling
    APPLIED INTELLIGENCE, 2023, 53 (22) : 27128 - 27147
  • [5] PAC-Bayesian Learning with Asymmetric Cost (June 2011)
    Llorens, Ashley J.
    Wang, I-Jeng
    2011 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2011, : 765 - 768
  • [6] Misclassification bounds for PAC-Bayesian sparse deep learning
    The Tien Mai
    Machine Learning, 2025, 114 (1)
  • [7] PAC-Bayesian offline Meta-reinforcement learning
    Zheng Sun
    Chenheng Jing
    Shangqi Guo
    Lingling An
    Applied Intelligence, 2023, 53 : 27128 - 27147
  • [8] Some PAC-Bayesian Theorems
    David A. McAllester
    Machine Learning, 1999, 37 : 355 - 363
  • [9] PAC-Bayesian generic chaining
    Audibert, JY
    Bousquet, O
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 1125 - 1132
  • [10] PAC-Bayesian Collective Stability
    London, Ben
    Huang, Bert
    Taskar, Ben
    Getoor, Lise
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 585 - 594