Principled analytic classifier for positive-unlabeled learning via weighted integral probability metric

被引:0
|
作者
Yongchan Kwon
Wonyoung Kim
Masashi Sugiyama
Myunghee Cho Paik
机构
[1] Seoul National University,Department of Statistics
[2] RIKEN,Center for Advanced Intelligence Project
[3] The University of Tokyo,Department of Complexity Science and Engineering, Graduate School of Frontier Sciences
来源
Machine Learning | 2020年 / 109卷
关键词
Positive and unlabeled learning; Integral probability metric; Excess risk bound; Approximation error; Reproducing kernel Hilbert space;
D O I
暂无
中图分类号
学科分类号
摘要
We consider the problem of learning a binary classifier from only positive and unlabeled observations (called PU learning). Recent studies in PU learning have shown superior performance theoretically and empirically. However, most existing algorithms may not be suitable for large-scale datasets because they face repeated computations of a large Gram matrix or require massive hyperparameter optimization. In this paper, we propose a computationally efficient and theoretically grounded PU learning algorithm. The proposed PU learning algorithm produces a closed-form classifier when the hypothesis space is a closed ball in reproducing kernel Hilbert space. In addition, we establish upper bounds of the estimation error and the excess risk. The obtained estimation error bound is sharper than existing results and the derived excess risk bound has an explicit form, which vanishes as sample sizes increase. Finally, we conduct extensive numerical experiments using both synthetic and real datasets, demonstrating improved accuracy, scalability, and robustness of the proposed algorithm.
引用
收藏
页码:513 / 532
页数:19
相关论文
共 50 条
  • [21] Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation
    Akujuobi, Uchenna
    Chen, Jun
    Elhoseiny, Mohamed
    Spranger, Michael
    Zhang, Xiangliang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [22] Estimating Visited Stores Through Positive-Unlabeled Learning
    Shirai, Ryo
    Imai, Ryo
    Liew, Seng Pei
    Amagata, Daichi
    Takahashi, Tsubasa
    Hara, Takahiro
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VII, DASFAA 2024, 2024, 14856 : 377 - 389
  • [23] A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling
    Wang, Ye
    Pan, Huazheng
    Zhang, Tao
    Wu, Wen
    Hu, Wenxin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19197 - 19205
  • [24] ROPU: A robust online positive-unlabeled learning algorithm
    Liang, Xijun
    Zhu, Kaili
    Xiao, An
    Wen, Ya
    Zhang, Kaili
    Wang, Suhang
    Jian, Ling
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [25] Construction of Fatigue Criteria Through Positive-Unlabeled Learning
    Coudray, Olivier
    Bristiel, Philippe
    Dinis, Miguel
    Keribin, Christine
    Pamphile, Patrick
    FATIGUE & FRACTURE OF ENGINEERING MATERIALS & STRUCTURES, 2025, 48 (01) : 101 - 117
  • [26] Positive-unlabeled learning for open set domain adaptation
    Loghmani, Mohammad Reza
    Vincze, Markus
    Tommasi, Tatiana
    PATTERN RECOGNITION LETTERS, 2020, 136 : 198 - 204
  • [27] Partial Optimal Transport with Applications on Positive-Unlabeled Learning
    Chapel, Laetitia
    Alaya, Mokhtar Z.
    Gasso, Gilles
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [28] Spotting Fake Reviews using Positive-Unlabeled Learning
    Li, Huayi
    Liu, Bing
    Mukherjee, Arjun
    Shao, Jidong
    COMPUTACION Y SISTEMAS, 2014, 18 (03): : 467 - 475
  • [29] Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning
    Niu, Gang
    du Plessis, Marthinus C.
    Sakai, Tomoya
    Ma, Yao
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [30] Investigating Active Positive-Unlabeled Learning with Deep Networks
    Han, Kun
    Chen, Weitong
    Xu, Miao
    AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 607 - 618