Unbiased Generative Semi-Supervised Learning

被引:0
|
作者
Fox-Roberts, Patrick [1 ]
Rosten, Edward [2 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Comp Vis Consulting, London EC2A 4BX, England
基金
英国工程与自然科学研究理事会;
关键词
Kullback-Leibler; semi-supervised; asymptotic bounds; bias; generative model; UNLABELED SAMPLES; MODELS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reliable semi-supervised learning, where a small amount of labelled data is complemented by a large body of unlabelled data, has been a long-standing goal of the machine learning community. However, while it seems intuitively obvious that unlabelled data can aid the learning process, in practise its performance has often been disappointing. We investigate this by examining generative maximum likelihood semi-supervised learning and derive novel upper and lower bounds on the degree of bias introduced by the unlabelled data. These bounds improve upon those provided in previous work, and are specifically applicable to the challenging case where the model is unable to exactly fit to the underlying distribution a situation which is common in practise, but for which fewer guarantees of semi-supervised performance have been found. Inspired by this new framework for analysing bounds, we propose a new, simple reweighing scheme which provides a provably unbiased estimator for arbitrary model/distribution pairs-an unusual property for a semi-supervised algorithm. This reweighing introduces no additional computational complexity and can be applied to very many models. Additionally, we provide specific conditions demonstrating the circumstance under which the unlabelled data will lower the estimator variance, thereby improving convergence.
引用
收藏
页码:367 / 443
页数:77
相关论文
共 50 条
  • [31] Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation
    Guan, Dayan
    Huang, Jiaxing
    Xiao, Aoran
    Lu, Shijian
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9958 - 9968
  • [32] Camouflaged Chinese Spam Content Detection with Semi-supervised Generative Active Learning
    Jiang, Zhuoren
    Gao, Zhe
    Duan, Yu
    Kang, Yangyang
    Sun, Changlong
    Zhang, Qiong
    Liu, Xiaozhong
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3080 - 3085
  • [33] Semi-supervised generative adversarial learning for denoising adaptive optics retinal images
    Wang, Shidan
    Li, Kaiwen
    Yin, Qi
    Ren, Ji
    Zhang, Jie
    [J]. BIOMEDICAL OPTICS EXPRESS, 2024, 15 (03) : 1437 - 1452
  • [34] Label-Noise Robust Deep Generative Model for Semi-Supervised Learning
    Yoon, Heegeon
    Kim, Heeyoung
    [J]. TECHNOMETRICS, 2023, 65 (01) : 83 - 95
  • [35] Multimodal deep generative adversarial models for scalable doubly semi-supervised learning
    Du, Changde
    Du, Changying
    He, Huiguang
    [J]. INFORMATION FUSION, 2021, 68 : 118 - 130
  • [36] Semi-Supervised Learning with Generative Adversarial Networks on Digital Signal Modulation Classification
    Tu, Ya
    Lin, Yun
    Wang, Jin
    Kim, Jeong-Uk
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 55 (02): : 243 - 254
  • [37] Semi-supervised Sequence Learning
    Dai, Andrew M.
    Le, Quoc V.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [38] Semi-supervised learning by disagreement
    Zhou, Zhi-Hua
    Li, Ming
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439
  • [39] A survey on semi-supervised learning
    Jesper E. van Engelen
    Holger H. Hoos
    [J]. Machine Learning, 2020, 109 : 373 - 440
  • [40] A Weak Coupling of Semi-Supervised Learning with Generative Adversarial Networks for Malware Classification
    Wang, Shuwei
    Wang, Qiuyun
    Jiang, Zhengwei
    Wang, Xuren
    Jing, Rongqi
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3775 - 3782