AESNB: Active Example Selection with Naive Bayes Classifier for Learning from Imbalanced Biomedical Data

被引:11
|
作者
Lee, Min Su [1 ]
Rhee, Je-Keun [2 ]
Kim, Byoung-Hee [1 ]
Zhang, Byoung-Tak [1 ,2 ]
机构
[1] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul, South Korea
[2] Seoul Natl Univ, Grad Program Bioinformat, Seoul, South Korea
关键词
active example selection; imbalanced data problem; naive Bayes classifier; resampling; cost-senseitive learning; PREDICTION; SMOTE;
D O I
10.1109/BIBE.2009.63
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Various real-world biomedical classification tasks suffer from the imbalanced data problem which tends to make the prediction performance of some classes significantly decrease. In this paper, we present an active example selection method with naive Bayes classifier (AESNB) as a solution for the imbalanced data problem. The proposed method starts with a small balanced subset of training examples. A naive Bayes classifier is trained incrementally by actively selecting and adding informative examples regardless of the original class distribution. Informative examples are defined as examples that produce high error scores by the current classifier. We examined the performance of AESNB algorithm by using five imbalanced biomedical datasets. Our experimental results show that the naive Bayes classifier with our active example selection method achieves a competitive classification performance compared to the classifier with sampling or cost-sensitive methods.
引用
收藏
页码:15 / +
页数:2
相关论文
共 50 条
  • [31] Uncertainty Based Under-Sampling for Learning Naive Bayes Classifiers Under Imbalanced Data Sets
    Aridas, Christos K.
    Karlos, Stamatis
    Kanas, Vasileios G.
    Fazakis, Nikos
    Kotsiantis, Sotiris B.
    [J]. IEEE ACCESS, 2020, 8 : 2122 - 2133
  • [32] Robust Multiclass Classification for Learning from Imbalanced Biomedical Data
    Piyaphol Phoungphol
    [J]. Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
  • [33] Robust multiclass classification for learning from imbalanced biomedical data
    Phoungphol, Piyaphol
    Zhang, Yanqing
    Zhao, Yichuan
    [J]. Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
  • [34] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    Wegier, Weronika
    [J]. IEEE Access, 2022, 10 : 16807 - 16818
  • [35] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    [J]. IEEE ACCESS, 2022, 10 : 16807 - 16818
  • [36] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [37] Learning naive Bayes for probability estimation by feature selection
    Jiang, Liangxiao
    Zhang, Harry
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 503 - 514
  • [38] Learning Naive Bayes Classifiers with Incomplete Data
    Leng, Cuiping
    Wang, Shuangcheng
    Wang, Hui
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 350 - +
  • [39] Naive Bayes Ensemble Learning Based on Oracle Selection
    Li, Kai
    Hao, Lifeng
    [J]. CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 665 - +
  • [40] A hybrid PSO with Naive Bayes classifier for disengagement detection in online learning
    Gopalakrishnan, T.
    Sengottuvelan, P.
    [J]. PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2016, 50 (02) : 215 - 224