Unbiased Generative Semi-Supervised Learning

被引:0
|
作者
Fox-Roberts, Patrick [1 ]
Rosten, Edward [2 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Comp Vis Consulting, London EC2A 4BX, England
基金
英国工程与自然科学研究理事会;
关键词
Kullback-Leibler; semi-supervised; asymptotic bounds; bias; generative model; UNLABELED SAMPLES; MODELS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reliable semi-supervised learning, where a small amount of labelled data is complemented by a large body of unlabelled data, has been a long-standing goal of the machine learning community. However, while it seems intuitively obvious that unlabelled data can aid the learning process, in practise its performance has often been disappointing. We investigate this by examining generative maximum likelihood semi-supervised learning and derive novel upper and lower bounds on the degree of bias introduced by the unlabelled data. These bounds improve upon those provided in previous work, and are specifically applicable to the challenging case where the model is unable to exactly fit to the underlying distribution a situation which is common in practise, but for which fewer guarantees of semi-supervised performance have been found. Inspired by this new framework for analysing bounds, we propose a new, simple reweighing scheme which provides a provably unbiased estimator for arbitrary model/distribution pairs-an unusual property for a semi-supervised algorithm. This reweighing introduces no additional computational complexity and can be applied to very many models. Additionally, we provide specific conditions demonstrating the circumstance under which the unlabelled data will lower the estimator variance, thereby improving convergence.
引用
收藏
页码:367 / 443
页数:77
相关论文
共 50 条
  • [41] Semi-Supervised Incremental Learning
    Bouchachia, Abdelhamid
    Prossegger, Markus
    Duman, Hakan
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [42] A Weak Coupling of Semi-Supervised Learning with Generative Adversarial Networks for Malware Classification
    Wang, Shuwei
    Wang, Qiuyun
    Jiang, Zhengwei
    Wang, Xuren
    Jing, Rongqi
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3775 - 3782
  • [43] Semi-Supervised Learning by Disagreement
    Zhou, Zhi-Hua
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 93 - 93
  • [44] Reliable Semi-supervised Learning
    Shao, Junming
    Huang, Chen
    Yang, Qinli
    Luo, Guangchun
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1197 - 1202
  • [45] Deep Semi-Supervised Learning
    Hailat, Zeyad
    Komarichev, Artem
    Chen, Xue-Wen
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2154 - 2159
  • [46] Semi-supervised Learning with Transfer Learning
    Zhou, Huiwei
    Zhang, Yan
    Huang, Degen
    Li, Lishuang
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 109 - 119
  • [47] Semi-supervised Learning with Flexible Discriminator Objective in Generative Adversarial Networks Framework
    Guo, Heng
    Wang, Wenqing
    Fan, Qifu
    Weng, Zhengxin
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9238 - 9243
  • [48] Semi-Supervised Learning for Seismic Impedance Inversion Using Generative Adversarial Networks
    Wu, Bangyu
    Meng, Delin
    Zhao, Haixia
    [J]. REMOTE SENSING, 2021, 13 (05) : 1 - 17
  • [49] Semi-supervised learning with dropouts
    Abhishek
    Yadav, Rakesh Kumar
    Verma, Shekhar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [50] PRIVILEGED SEMI-SUPERVISED LEARNING
    Chen, Xingyu
    Gong, Chen
    Ma, Chao
    Huang, Xiaolin
    Yang, Jie
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2999 - 3003