Maximizing AUC to learn weighted naive Bayes for imbalanced data classification

被引:16
|
作者
Kim, Taeheung [1 ]
Lee, Jong-Seok [1 ]
机构
[1] Sungkyunkwan Univ, Dept Ind Engn, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
Class imbalance; Naive Bayes; Attribute weighting; Area under ROC; Nonlinear optimization; CLASSIFIERS; ALGORITHMS; AREA;
D O I
10.1016/j.eswa.2023.119564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data classification is a challenging problem frequently encountered in many real-world applica-tions. Traditional classification algorithms are generally designed to maximize overall accuracy; therefore, their effectiveness tends to be impeded by imbalanced data. Similar to other traditional classifiers, naive Bayes (NB) sometimes fails at predicting minority instances owing to its sensitivity to class distribution. To cope with this challenge, we proposed RankOptAUC NB (RNB), a novel attribute weighting method for the NB. In the proposed method, learning a weighted NB classifier was formulated as a nonlinear optimization problem with the objective of maximizing the area under the ROC (AUC). The optimization formulation enabled the RNB method to select important variables by simply adding a regularization term to the objective function. We also provided theoretical evidence that, based on the AUC metric, the proposed method improved the performance of a weighted NB classifier. The results of numerical experiments conducted using 30 real-world datasets proved that the proposed scheme successfully determined the optimal attribute weights for imbalanced data classification.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Local resampling for locally weighted Naive Bayes in imbalanced data
    Saglam, Fatih
    Cengiz, Mehmet Ali
    [J]. COMPUTING, 2024, 106 (01) : 185 - 200
  • [2] Naive Bayes Classification of Uncertain Data
    Ren, Jiangtao
    Lee, Sau Dan
    Chen, Xianlu
    Kao, Ben
    Cheng, Reynold
    Cheung, David
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 944 - +
  • [3] Measurement Classification Using Hybrid Weighted Naive Bayes
    Hamblin, David
    Wang, Dali
    Chen, Gao
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2016, : 6 - 11
  • [4] Weighted Naive Bayes Approach for Imbalanced Indoor Positioning System Using UWB
    Che, Fuhu
    Bin Abbas, Waqas
    Ahmed, Qasim Zeeshan
    Amjad, Bisma
    Khan, Faheem Ahmad
    Lazaridis, Pavlos I.
    [J]. 2022 IEEE INTERNATIONAL BLACK SEA CONFERENCE ON COMMUNICATIONS AND NETWORKING (BLACKSEACOM), 2022, : 72 - 76
  • [5] Evolving Neural Networks with Maximum AUC for Imbalanced Data Classification
    Lu, Xiaofen
    Tang, Ke
    Yao, Xin
    [J]. HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 1, 2010, 6076 : 335 - 342
  • [6] Artificial Immune System for Attribute Weighted Naive Bayes Classification
    Wu, Jia
    Cai, Zhihua
    Zeng, Sanyou
    Zhu, Xingquan
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [7] SEIR Immune Strategy for Instance Weighted Naive Bayes Classification
    Xue, Shan
    Lu, Jie
    Zhang, Guangquan
    Xiong, Li
    [J]. NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 283 - 292
  • [8] A gradient approach for value weighted classification learning in naive Bayes
    Lee, Chang-Hwan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 85 : 71 - 79
  • [9] DISCRIMINATIVELY WEIGHTED NAIVE BAYES AND ITS APPLICATION IN TEXT CLASSIFICATION
    Jiang, Liangxiao
    Wang, Dianghong
    Cai, Zhihua
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2012, 21 (01)
  • [10] A Double Weighted Naive Bayes for Multi-label Classification
    Yan, Xuesong
    Li, Wei
    Wu, Qinghua
    Sheng, Victor S.
    [J]. COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015), 2016, 575 : 382 - 389