Hypothesis Testing for Class-Conditional Label Noise

被引:0
|
作者
Poyiadzi, Rafael [1 ]
Yang, Weisong [1 ]
Twomey, Niall [1 ]
Santos-Rodriguez, Raul [1 ]
机构
[1] Univ Bristol, Bristol, Avon, England
关键词
CLASSIFICATION;
D O I
10.1007/978-3-031-26409-2_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we aim to provide machine learning practitioners with tools to answer the question: have the labels in a dataset been corrupted? In order to simplify the problem, we assume the practitioner already has preconceptions on possible distortions that may have affected the labels, which allow us to pose the task as the design of hypothesis tests. As a first approach, we focus on scenarios where a given dataset of instance-label pairs has been corrupted with class-conditional label noise, as opposed to uniform label noise, with the former biasing learning, while the latter - under mild conditions - does not. While previous works explore the direct estimation of the noise rates, this is known to be hard in practice and does not offer a real understanding of how trustworthy the estimates are. These methods typically require anchor points - examples whose true posterior is either 0 or 1. Differently, in this paper we assume we have access to a set of anchor points whose true posterior is approximately 1/2. The proposed hypothesis tests are built upon the asymptotic properties of Maximum Likelihood Estimators for Logistic Regression models. We establish the main properties of the tests, including a theoretical and empirical analysis of the dependence of the power on the test on the training sample size, the number of anchor points, the difference of the noise rates and the use of relaxed anchors.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 50 条
  • [21] Naive and Robust: Class-Conditional Independence in Human Classification Learning
    Jarecki, Jana B.
    Meder, Bjoern
    Nelson, Jonathan D.
    [J]. COGNITIVE SCIENCE, 2018, 42 (01) : 4 - 42
  • [22] Selection of Class-conditional Filters for Semantic Shifted OOD Detection
    Yu, Yeonguk
    Shin, Sungho
    Kim, Jongwon
    Lee, Kyoobin
    [J]. 2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 43 - 47
  • [23] Multinomial classification with class-conditional overlapping sparse feature groups
    Li, Xiangrui
    Zhu, Dongxiao
    Dong, Ming
    [J]. PATTERN RECOGNITION LETTERS, 2018, 101 : 37 - 43
  • [24] Class-conditional Importance Weighting for Deep Learning with Noisy Labels
    Nagarajan, Bhalaji
    Marques, Ricardo
    Mejia, Marcos
    Radeva, Petia
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 679 - 686
  • [25] CCGG: A Deep Autoregressive Model for Class-Conditional Graph Generation
    Ommi, Yassaman
    Yousefabadi, Matin
    Faez, Faezeh
    Sabour, Amirmojtaba
    Baghshah, Mahdieh Soleymani
    Rabiee, Hamid R.
    [J]. COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 1092 - 1098
  • [26] Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction
    Kim, Yunji
    Nam, Seonghyeon
    Cho, In
    Kim, Seon Joo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] Quantifying quality of class-conditional generative models in time series domain
    Koochali, Alireza
    Walch, Maria
    Thota, Sankrutyayan
    Schichtel, Peter
    Dengel, Andreas
    Ahmed, Sheraz
    [J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 24530 - 24550
  • [28] Detection of underrepresented biological sequences using class-conditional distribution models
    Vucetic, S
    Pokrajac, D
    Xie, HB
    Obradovic, Z
    [J]. PROCEEDINGS OF THE THIRD SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2003, : 279 - 283
  • [29] Compact class-conditional domain invariant learning for multi-class domain adaptation
    Lee, Woojin
    Kim, Hoki
    Lee, Jaewook
    [J]. PATTERN RECOGNITION, 2021, 112
  • [30] Multiclass object recognition using class-conditional independent component analysis
    Bressan, M
    Guillamet, D
    Vitrià, J
    [J]. CYBERNETICS AND SYSTEMS, 2004, 35 (01) : 35 - 61