Hypothesis Testing for Class-Conditional Label Noise

被引：0

作者：

Poyiadzi, Rafael ^{[1
]}

Yang, Weisong ^{[1
]}

Twomey, Niall ^{[1
]}

Santos-Rodriguez, Raul ^{[1
]}

机构：

[1] Univ Bristol, Bristol, Avon, England

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III | 2023年 / 13715卷

关键词：

CLASSIFICATION;

D O I：

10.1007/978-3-031-26409-2_11

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we aim to provide machine learning practitioners with tools to answer the question: have the labels in a dataset been corrupted? In order to simplify the problem, we assume the practitioner already has preconceptions on possible distortions that may have affected the labels, which allow us to pose the task as the design of hypothesis tests. As a first approach, we focus on scenarios where a given dataset of instance-label pairs has been corrupted with class-conditional label noise, as opposed to uniform label noise, with the former biasing learning, while the latter - under mild conditions - does not. While previous works explore the direct estimation of the noise rates, this is known to be hard in practice and does not offer a real understanding of how trustworthy the estimates are. These methods typically require anchor points - examples whose true posterior is either 0 or 1. Differently, in this paper we assume we have access to a set of anchor points whose true posterior is approximately 1/2. The proposed hypothesis tests are built upon the asymptotic properties of Maximum Likelihood Estimators for Logistic Regression models. We establish the main properties of the tests, including a theoretical and empirical analysis of the dependence of the power on the test on the training sample size, the number of anchor points, the difference of the noise rates and the use of relaxed anchors.

引用

页码：171 / 186

页数：16

共 50 条

[21] Naive and Robust: Class-Conditional Independence in Human Classification Learning
Jarecki, Jana B.
Meder, Bjoern
Nelson, Jonathan D.
[J]. COGNITIVE SCIENCE, 2018, 42 (01) : 4 - 42
[22] Selection of Class-conditional Filters for Semantic Shifted OOD Detection
Yu, Yeonguk
Shin, Sungho
Kim, Jongwon
Lee, Kyoobin
[J]. 2021 18TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2021, : 43 - 47
[23] Multinomial classification with class-conditional overlapping sparse feature groups
Li, Xiangrui
Zhu, Dongxiao
Dong, Ming
[J]. PATTERN RECOGNITION LETTERS, 2018, 101 : 37 - 43
[24] Class-conditional Importance Weighting for Deep Learning with Noisy Labels
Nagarajan, Bhalaji
Marques, Ricardo
Mejia, Marcos
Radeva, Petia
[J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, : 679 - 686
[25] CCGG: A Deep Autoregressive Model for Class-Conditional Graph Generation
Ommi, Yassaman
Yousefabadi, Matin
Faez, Faezeh
Sabour, Amirmojtaba
Baghshah, Mahdieh Soleymani
Rabiee, Hamid R.
[J]. COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 1092 - 1098
[26] Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction
Kim, Yunji
Nam, Seonghyeon
Cho, In
Kim, Seon Joo
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[27] Quantifying quality of class-conditional generative models in time series domain
Koochali, Alireza
Walch, Maria
Thota, Sankrutyayan
Schichtel, Peter
Dengel, Andreas
Ahmed, Sheraz
[J]. APPLIED INTELLIGENCE, 2023, 53 (20) : 24530 - 24550
[28] Detection of underrepresented biological sequences using class-conditional distribution models
Vucetic, S
Pokrajac, D
Xie, HB
Obradovic, Z
[J]. PROCEEDINGS OF THE THIRD SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2003, : 279 - 283
[29] Compact class-conditional domain invariant learning for multi-class domain adaptation
Lee, Woojin
Kim, Hoki
Lee, Jaewook
[J]. PATTERN RECOGNITION, 2021, 112
[30] Multiclass object recognition using class-conditional independent component analysis
Bressan, M
Guillamet, D
Vitrià, J
[J]. CYBERNETICS AND SYSTEMS, 2004, 35 (01) : 35 - 61

← 1 2 3 4 5 →