Cost-Sensitive Learning with Noisy Labels

被引:0
|
作者
Natarajan, Nagarajan [1 ,6 ]
Dhillon, Inderjit S. [2 ]
Ravikumar, Pradeep [3 ]
Tewari, Ambuj [4 ,5 ]
机构
[1] Microsoft Res, Bangalore 560001, Karnataka, India
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78701 USA
[3] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
[4] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[6] Univ Texas Austin, Austin, TX 78712 USA
关键词
class-conditional label noise; statistical consistency; cost-sensitive learning; CLASSIFICATION; PERCEPTRONS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study binary classification in the presence of class-conditional random noise, where the learner gets to see labels that are flipped independently with some probability, and where the flip probability depends on the class. Our goal is to devise learning algorithms that are efficient and statistically consistent with respect to commonly used utility measures. In particular, we look at a family of measures motivated by their application in domains where cost-sensitive learning is necessary (for example, when there is class imbalance). In contrast to most of the existing literature on consistent classification that are limited to the classical 0-1 loss, our analysis includes more general utility measures such as the AM measure (arithmetic mean of True Positive Rate and True Negative Rate). For this problem of cost-sensitive learning under class-conditional random noise, we develop two approaches that are based on suitably modifying surrogate losses. First, we provide a simple unbiased estimator of any loss, and obtain performance bounds for empirical utility maximization in the presence of i.i.d. data with noisy labels. If the loss function satisfies a simple symmetry condition, we show that using unbiased estimator leads to an efficient algorithm for empirical maximization. Second, by leveraging a reduction of risk minimization under noisy labels to classification with weighted 0-1 loss, we suggest the use of a simple weighted surrogate loss, for which we are able to obtain strong utility bounds. This approach implies that methods already used in practice, such as biased SVM and weighted logistic regression, are provably noise-tolerant. For two practically important measures in our family, we show that the proposed methods are competitive with respect to recently proposed methods for dealing with label noise in several benchmark data sets.
引用
收藏
页码:1 / 33
页数:33
相关论文
共 50 条
  • [1] Cost-Sensitive Learning
    Zhou, Zlii-Hua
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2011, 2011, 6820 : 17 - 18
  • [2] Cost-Sensitive Learning to Rank
    McBride, Ryan
    Wang, Ke
    Ren, Zhouyang
    Li, Wenyuan
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4570 - 4577
  • [3] Active Cost-Sensitive Learning
    Margineantu, Dragos D.
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1622 - 1623
  • [4] Cost-sensitive learning of SVM for ranking
    Xu, Jun
    Cao, Yunbo
    Li, Hang
    Huang, Yalou
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 833 - 840
  • [5] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [6] Active Learning for Cost-Sensitive Classification
    Krishnamurthy, Akshay
    Agarwal, Alekh
    Huang, Tzu-Kuo
    Daume, Hal, III
    Langford, John
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [7] Cost-Sensitive Learning in Answer Extraction
    Wiegand, Michael
    Leidner, Jochen L.
    Klakow, Dietrich
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 711 - 714
  • [8] Adversarial Learning With Cost-Sensitive Classes
    Shen, Haojing
    Chen, Sihong
    Wang, Ran
    Wang, Xizhao
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 4855 - 4866
  • [9] Robust SVM for Cost-Sensitive Learning
    Jiangzhang Gan
    Jiaye Li
    Yangcai Xie
    [J]. Neural Processing Letters, 2022, 54 : 2737 - 2758
  • [10] Cost-Sensitive Action Model Learning
    Rao, Dongning
    Jiang, Zhihua
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2016, 24 (02) : 167 - 193