Hypothesis Testing for Class-Conditional Noise Using Local Maximum Likelihood

被引:0
|
作者
Yang, Weisong [1 ]
Poyiadzi, Rafael [2 ]
Twomey, Niall [1 ]
Santos-Rodriguez, Raul [1 ]
机构
[1] Univ Bristol, Bristol, England
[2] GSK, London, England
关键词
INTERNET; ACCURATE; HEALTH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In supervised learning, automatically assessing the quality of the labels before any learning takes place remains an open research question. In certain particular cases, hypothesis testing procedures have been proposed to assess whether a given instance-label dataset is contaminated with class-conditional label noise, as opposed to uniform label noise. The existing theory builds on the asymptotic properties of the Maximum Likelihood Estimate for parametric logistic regression. However, the parametric assumptions on top of which these approaches are constructed are often too strong and unrealistic in practice. To alleviate this problem, in this paper we propose an alternative path by showing how similar procedures can be followed when the underlying model is a product of Local Maximum Likelihood Estimation that leads to more flexible nonparametric logistic regression models, which in turn are less susceptible to model misspecification. This different view allows for wider applicability of the tests by offering users access to a richer model class. Similarly to existing works, we assume we have access to anchor points which are provided by the users. We introduce the necessary ingredients for the adaptation of the hypothesis tests to the case of nonparametric logistic regression and empirically compare against the parametric approach presenting both synthetic and real-world case studies and discussing the advantages and limitations of the proposed approach.
引用
收藏
页码:21744 / 21752
页数:9
相关论文
共 50 条
  • [1] Hypothesis Testing for Class-Conditional Label Noise
    Poyiadzi, Rafael
    Yang, Weisong
    Twomey, Niall
    Santos-Rodriguez, Raul
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 171 - 186
  • [2] Latent Class-Conditional Noise Model
    Yao, Jiangchao
    Han, Bo
    Zhou, Zhihan
    Zhang, Ya
    Tsang, Ivor W.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9964 - 9980
  • [3] Class-Conditional Label Noise in Astroparticle Physics
    Bunse, Mirko
    Pfahler, Lukas
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 19 - 35
  • [4] Phylogeny estimation and hypothesis testing using maximum likelihood
    Huelsenbeck, JP
    Crandall, KA
    ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 1997, 28 : 437 - 466
  • [5] Improving naive Bayes using class-conditional ICA
    Bressan, M
    Vitrià, J
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2002, PROCEEDINGS, 2002, 2527 : 1 - 10
  • [6] CCMN: A General Framework for Learning With Class-Conditional Multi-Label Noise
    Xie, Ming-Kun
    Huang, Sheng-Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 154 - 166
  • [7] Species Distribution Modeling of Citizen Science Data as a Classification Problem with Class-Conditional Noise
    Hutchinson, Rebecca A.
    He, Liqiang
    Emerson, Sarah C.
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4516 - 4523
  • [8] Multiclass object recognition using class-conditional independent component analysis
    Bressan, M
    Guillamet, D
    Vitrià, J
    CYBERNETICS AND SYSTEMS, 2004, 35 (01) : 35 - 61
  • [9] Detection of underrepresented biological sequences using class-conditional distribution models
    Vucetic, S
    Pokrajac, D
    Xie, HB
    Obradovic, Z
    PROCEEDINGS OF THE THIRD SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2003, : 279 - 283
  • [10] Classification with unknown class-conditional label noise on non-compact feature spaces
    Reeve, HenryW. J.
    Kaban, Ata
    CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99