Classification from Positive, Unlabeled and Biased Negative Data

被引:0
|
作者
Hsieh, Yu-Guan [1 ]
Niu, Gang [2 ]
Sugiyama, Masashi [2 ,3 ]
机构
[1] Ecole Normale Super, Paris, France
[2] RIKEN, Tokyo, Japan
[3] Univ Tokyo, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In binary classification, there are situations where negative (N) data are too diverse to be fully labeled and we often resort to positive-unlabeled (PU) learning in these scenarios. However, collecting a non-representative N set that contains only a small portion of all possible N data can often be much easier in practice. This paper studies a novel classification framework which incorporates such biased N (bN) data in PU learning. We provide a method based on empirical risk minimization to address this PUbN classification problem. Our approach can be regarded as a novel example-weighting algorithm, with the weight of each example computed through a preliminary step that draws inspiration from PU learning. We also derive an estimation error bound for the proposed method. Experimental results demonstrate the effectiveness of our algorithm in not only PUbN learning scenarios but also ordinary PU learning scenarios on several benchmark datasets.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Recovering the Propensity Score from Biased Positive Unlabeled Data
    Gerych, Walter
    Hartvigsen, Thomas
    Buiquicchio, Luke
    Agu, Emmanuel
    Rundensteiner, Elke
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6694 - 6702
  • [2] Classification From Positive and Biased Negative Data With Skewed Labeled Posterior Probability
    Watanabe, Shotaro
    Matsui, Hidetoshi
    [J]. NEURAL COMPUTATION, 2023, 35 (05) : 977 - 994
  • [3] CLASSIFICATION FROM ONLY POSITIVE AND UNLABELED FUNCTIONAL DATA
    Terada, Yoshikazu
    Ogasawara, Issei
    Nakata, Ken
    [J]. ANNALS OF APPLIED STATISTICS, 2020, 14 (04): : 1724 - 1742
  • [4] Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data
    Sakai, Tomoya
    du Plessis, Marthinus Christoffel
    Niu, Gang
    Sugiyama, Masashi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [5] Classification from positive and unlabeled data based on likelihood invariance for measurement
    Yoshida, Takeshi
    Washio, Takashi
    Ohshiro, Takahito
    Taniguchi, Masateru
    [J]. INTELLIGENT DATA ANALYSIS, 2021, 25 (01) : 57 - 79
  • [6] BINARY CLASSIFICATION ONLY FROM UNLABELED DATA BY ITERATIVE UNLABELED-UNLABELED CLASSIFICATION
    Kaji, Hirotaka
    Sugiyama, Masashi
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3527 - 3531
  • [7] Biological sequence classification utilizing positive and unlabeled data
    Xiao, Yuanyuan
    Segal, Mark R.
    [J]. BIOINFORMATICS, 2008, 24 (09) : 1198 - 1205
  • [8] One-Class Remote Sensing Classification From Positive and Unlabeled Background Data
    Li, Wenkai
    Guo, Qinghua
    Elkan, Charles
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 730 - 746
  • [9] Cool Blog Classification from Positive and Unlabeled Examples
    Sriphaew, Kritsada
    Takamura, Hiroya
    Okumura, Manabu
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 62 - 73
  • [10] Phonocardiogram Classification by Learning From Positive and Unlabeled Examples
    Nehary, Ebrahim A.
    Rajan, Sreeraman
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 14