Classification using Dirichlet priors when the training data are mislabeled

被引:0
|
作者
Lynch, RS [1 ]
Willett, PK [1 ]
机构
[1] Naval Undersea Warfare Ctr, Newport, RI 02841 USA
关键词
D O I
10.1109/ICASSP.1999.761387
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The average probability of error is used to demonstrate performance of a Bayesian classification test (referred to as the Combined Bayes Test (CBT)) given the training data of each class are mislabeled. The CBT combines the information in discrete training and test data to infer symbol probabilities, where a uniform Dirichlet prior (i.e., a noninformative prior of complete ignorance) is assumed for all classes. Using this prior it is shown how classification performance degrades when mislabeling exists in the training data, and this occurs with st severity that depends on the value of the mislabeling probabilities. However, an increase in the mislabeling probabilities are also shown to cause an increase in M* (i.e., the best quantization fineness). Further, even when the actual mislabeling probabilities are known by the CBT it is not possible to achieve the classification performance obtainable without mislabeling.
引用
收藏
页码:2973 / 2976
页数:4
相关论文
共 50 条
  • [21] Detection and Correction of Mislabeled Training Samples for Hyperspectral Image Classification
    Kang, Xudong
    Duan, Puhong
    Xiang, Xuanlin
    Li, Shutao
    Benediktsson, Jon Atli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (10): : 5673 - 5686
  • [22] Multi-task learning for classification with Dirichlet process priors
    Department of Electrical and Computer Engineering, Duke University, Durham, NC 27708, United States
    不详
    J. Mach. Learn. Res., 2007, (35-63):
  • [23] Multi-task learning for classification with Dirichlet process priors
    Xue, Ya
    Liao, Xuejun
    Carin, Lawrence
    Krishnapuram, Balaji
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 35 - 63
  • [24] Filtering mislabeled data for improving time series classification
    Pelletier, C.
    Valero, S.
    Inglada, J.
    Dedieu, G.
    Champion, N.
    2017 9TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2017,
  • [25] Estimation of Dirichlet process priors with monotone missing data
    Yang, Lei
    Wu, Xianyi
    JOURNAL OF NONPARAMETRIC STATISTICS, 2013, 25 (04) : 787 - 807
  • [26] Cost-sensitive elimination of mislabeled training data
    Guan, Donghai
    Yuan, Weiwei
    Ma, Tinghuai
    Khattak, Asad Masood
    Chow, Francis
    INFORMATION SCIENCES, 2017, 402 : 170 - 181
  • [27] Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data
    Gish, Herbert
    Silovsky, Jan
    Sung, Man-Ling
    Siu, Man-Hung
    Hartmann, William
    Jiang, Zhuolin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8394 - 8398
  • [28] Contextual Learning in Ground-Penetrating Radar Data Using Dirichlet Process Priors
    Ratto, Christopher R.
    Morton, Kenneth D., Jr.
    Collins, Leslie M.
    Torrione, Peter A.
    DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVI, 2011, 8017
  • [29] Comparison of MLP Cost Functions to Dodge Mislabeled Training Data
    Nieminen, Paavo
    Karkkainen, Tommi
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [30] Bayesian multiple comparisons using Dirichlet process priors
    Gopalan, R
    Berry, DA
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (443) : 1130 - 1139