Hypothesis Testing for Class-Conditional Label Noise

被引:0
|
作者
Poyiadzi, Rafael [1 ]
Yang, Weisong [1 ]
Twomey, Niall [1 ]
Santos-Rodriguez, Raul [1 ]
机构
[1] Univ Bristol, Bristol, Avon, England
关键词
CLASSIFICATION;
D O I
10.1007/978-3-031-26409-2_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we aim to provide machine learning practitioners with tools to answer the question: have the labels in a dataset been corrupted? In order to simplify the problem, we assume the practitioner already has preconceptions on possible distortions that may have affected the labels, which allow us to pose the task as the design of hypothesis tests. As a first approach, we focus on scenarios where a given dataset of instance-label pairs has been corrupted with class-conditional label noise, as opposed to uniform label noise, with the former biasing learning, while the latter - under mild conditions - does not. While previous works explore the direct estimation of the noise rates, this is known to be hard in practice and does not offer a real understanding of how trustworthy the estimates are. These methods typically require anchor points - examples whose true posterior is either 0 or 1. Differently, in this paper we assume we have access to a set of anchor points whose true posterior is approximately 1/2. The proposed hypothesis tests are built upon the asymptotic properties of Maximum Likelihood Estimators for Logistic Regression models. We establish the main properties of the tests, including a theoretical and empirical analysis of the dependence of the power on the test on the training sample size, the number of anchor points, the difference of the noise rates and the use of relaxed anchors.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 50 条
  • [1] Class-Conditional Label Noise in Astroparticle Physics
    Bunse, Mirko
    Pfahler, Lukas
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 19 - 35
  • [2] Latent Class-Conditional Noise Model
    Yao, Jiangchao
    Han, Bo
    Zhou, Zhihan
    Zhang, Ya
    Tsang, Ivor W.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9964 - 9980
  • [3] CCMN: A General Framework for Learning With Class-Conditional Multi-Label Noise
    Xie, Ming-Kun
    Huang, Sheng-Jun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 154 - 166
  • [4] Classification with unknown class-conditional label noise on non-compact feature spaces
    Reeve, HenryW. J.
    Kaban, Ata
    [J]. CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
  • [5] Beyond Class-Conditional Assumption: A Primary Attempt to Combat Instance-Dependent Label Noise
    Chen, Pengfei
    Ye, Junjie
    Chen, Guangyong
    Zhao, Jingwei
    Heng, Pheng-Ann
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11442 - 11450
  • [6] Learning a metric for class-conditional KNN
    Im, Daniel Jiwoong
    Taylor, Graham W.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1932 - 1939
  • [7] Species Distribution Modeling of Citizen Science Data as a Classification Problem with Class-Conditional Noise
    Hutchinson, Rebecca A.
    He, Liqiang
    Emerson, Sarah C.
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4516 - 4523
  • [8] Class-conditional domain adaptation for semantic segmentation
    Wang, Yue
    Li, Yuke
    Elder, James H.
    Wu, Runmin
    Lu, Huchuan
    [J]. COMPUTATIONAL VISUAL MEDIA, 2024, 10 (03) : 425 - 438
  • [9] Class-Conditional Conformal Prediction with Many Classes
    Ding, Tiffany
    Angelopoulos, Anastasios N.
    Bates, Stephen
    Jordan, Michael I.
    Tibshirani, Ryan J.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Quantification Under Class-Conditional Dataset Shift
    Spence, David
    Inskip, Christopher
    Quadrianto, Novi
    Weir, David
    [J]. PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019), 2019, : 528 - 529