An AdaBoost Method with K'K-Means Bayes Classifier for Imbalanced Data

被引:1
|
作者
Zhang, Yanfeng [1 ]
Wang, Lichun [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Stat, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
imbalanced data; naive Bayes; imbalanced classifiers; AdaBoost method; ALGORITHM;
D O I
10.3390/math11081878
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This article proposes a new AdaBoost method with k'k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the k'k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] K-means Bayes algorithm for imbalanced fault classification and big data application
    Chen, Gecheng
    Liu, Yue
    Ge, Zhiqiang
    JOURNAL OF PROCESS CONTROL, 2019, 81 : 54 - 64
  • [2] Imbalanced data optimization combining K-means and SMOTE
    Li W.
    International Journal of Performability Engineering, 2019, 15 (08): : 2173 - 2181
  • [3] An Integration of K-Means Clustering and Naive Bayes Classifier for Intrusion Detection
    Varuna, S.
    Natesan, P.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [4] A K-means triangular synthesis large margin classifier with unified pinball loss for imbalanced data
    Shao, Danlin
    Dai, Yixi
    Li, Junjie
    Li, Shenglin
    Chen, Rui
    APPLIED SOFT COMPUTING, 2024, 167
  • [5] An Improved Native Bayes Classifier for Imbalanced Text Categorization Based on K-means and CHI-square Feature Selection
    Meng Fanbo
    Xu Linying
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 894 - 898
  • [6] The k-means forest classifier for high dimensional data
    Chen, Zizhong
    Ding, Xin
    Xia, Shuyin
    Chen, Baiyun
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 322 - 327
  • [7] 基于K-means Bayes和AdaBoost-SVM的故障分类
    黄子扬
    周凌柯
    计算机系统应用, 2022, 31 (07) : 239 - 246
  • [8] Undersampled K-means approach for handling imbalanced distributed data
    Kumar, N. Santhosh
    Rao, K. Nageswara
    Govardhan, A.
    Reddy, K. Sudheer
    Mahmood, Ali Mirza
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2014, 3 (01) : 29 - 38
  • [9] An Improved Oversampling Method for imbalanced Data-SMOTE Based on Canopy and K-means
    Guo, Chaoyou
    Ma, Yankun
    Xu, Zhe
    Cao, Mengmeng
    Yao, Qian
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1467 - 1469
  • [10] A K-Means Remote Sensing Image Classification Method Based On AdaBoost
    Zheng, Jian
    Cui, Zhanzhong
    Liu, Anfei
    Jia, Yu
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 27 - +