A boosting framework for positive-unlabeled learning

被引:0
|
作者
Zhao, Yawen [1 ]
Zhang, Mingzhe [1 ]
Zhang, Chenhao [1 ]
Chen, Weitong [2 ]
Ye, Nan [1 ]
Xu, Miao [1 ]
机构
[1] Univ Queensland, Brisbane, Qld, Australia
[2] Univ Adelaide, Adelaide, SA, Australia
基金
澳大利亚研究理事会;
关键词
Boosting; Weakly supervised learning; PU learning; Ensemble;
D O I
10.1007/s11222-024-10529-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Positive-unlabeled (PU) learning deals with binary classification problems where only positive and unlabeled data are available. In this paper, we introduce a novel boosting framework, Adaptive PU (AdaPU), for learning from PU data. AdaPU builds an ensemble of weak classifiers using weak learners tailored to PU data. We propose two main approaches for learning the weak classifiers: a direct loss minimization approach that learns weak classifiers to greedily minimize PU-data-based estimates of the exponential loss, specifically, the unbiased PU estimate and the non-negative PU estimate; and a constrained loss minimization approach that learns weak classifiers to greedily minimize the unbiased PU estimate of the exponential loss, subject to regularization constraints. The direct loss minimization approach, while natural and simple, often yields weak learners prone to overfitting or leads to computationally expensive algorithms. On the other hand, the constrained loss minimization approach can effectively alleviate overfitting and allow the design of efficient weak learners. In particular, we propose a tailored weak learner for the simple class of decision stumps, or one-level decision trees, which interestingly demonstrates strong performance in comparison to various other weak classifiers. Furthermore, we provide several theoretical results on the performance of AdaPU. We performed extensive experiments to evaluate the variants of AdaPU and various baseline algorithms. Our results demonstrate the effectiveness of the constrained loss minimization approach.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Biometric identity recognition based on contrastive positive-unlabeled learning
    Sun, Le
    Hua, Yiwen
    Muhammad, Ghulam
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2024, 83
  • [42] Positive-unlabeled learning for coronary artery segmentation in CCTA images
    Chen, Fei
    Li, Sulei
    Wei, Chen
    Zhang, Yue
    Guo, Kaitai
    Zheng, Yang
    Cao, Feng
    Liang, Jimin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [43] Positive-Unlabeled Learning with Non-Negative Risk Estimator
    Kiryo, Ryuichi
    Niu, Gang
    du Plessis, Marthinus C.
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [44] A flexible procedure for mixture proportion estimation in positive-unlabeled learning
    Lin, Zhenfeng
    Long, James P.
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (02) : 178 - 187
  • [45] Information-Theoretic Representation Learning for Positive-Unlabeled Classification
    Sakai, Tomoya
    Niu, Gang
    Sugiyama, Masashi
    NEURAL COMPUTATION, 2021, 33 (01) : 244 - 268
  • [46] A multi-task positive-unlabeled learning framework to predict secreted proteins in human body fluids
    Kai He
    Yan Wang
    Xuping Xie
    Dan Shao
    Complex & Intelligent Systems, 2024, 10 : 1319 - 1331
  • [47] A multi-task positive-unlabeled learning framework to predict secreted proteins in human body fluids
    He, Kai
    Wang, Yan
    Xie, Xuping
    Shao, Dan
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 1319 - 1331
  • [48] Unsupervised Body Hair Detection by Positive-Unlabeled Learning in Photoacoustic Image
    Kikkawa, Ryo
    Kajita, Hiroki
    Imanishi, Nobuaki
    Aiso, Sadakazu
    Bise, Ryoma
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3349 - 3352
  • [49] Entropy Weight Allocation: Positive-unlabeled Learning via Optimal Transport
    Gu, Wen
    Zhang, Teng
    Jin, Hai
    PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2022, : 37 - 45
  • [50] Positive-unlabeled learning for the prediction of conformational B-cell epitopes
    Jing Ren
    Qian Liu
    John Ellis
    Jinyan Li
    BMC Bioinformatics, 16