An AdaBoost Method with K'K-Means Bayes Classifier for Imbalanced Data

被引:1
|
作者
Zhang, Yanfeng [1 ]
Wang, Lichun [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Stat, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
imbalanced data; naive Bayes; imbalanced classifiers; AdaBoost method; ALGORITHM;
D O I
10.3390/math11081878
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This article proposes a new AdaBoost method with k'k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the k'k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Incremental k-Means Method
    Prasad, Rabinder Kumar
    Sarmah, Rosy
    Chakraborty, Subrata
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 38 - 46
  • [22] k-means nearest neighbor classifier for voice pathology
    Ananthakrishna, T
    Shama, K
    Niranjan, UC
    Proceedings of the IEEE INDICON 2004, 2004, : 352 - 354
  • [23] A Constructing Method of Fuzzy Classifier Using Kernel K-means Clustering Algorithm
    Yang, Aimin
    Li, Qing
    Li, Xinguang
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 2, 2009, : 73 - +
  • [24] Clusterization by the K-means method when K is unknown
    Litvinenko, Natalya
    Mamyrbayev, Orken
    Shayakhmetova, Assem
    Turdalyuly, Mussa
    AMCSE 2018 - INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2019, 24
  • [25] A K-means Clustering Based Under-Sampling Method for Imbalanced Dataset Classification
    Huang, Chih-Ming
    Hung, Chuan-Sheng
    Hsu, Yao-Yuan
    Zheng, You-Cheng
    Yu, Cheng-Han
    Lin, Chun-Hung Richard
    Chen, Shi-Huang
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 708 - 713
  • [26] Kernel Penalized K-means: A feature selection method based on Kernel K-means
    Maldonado, Sebastian
    Carrizosa, Emilio
    Weber, Richard
    INFORMATION SCIENCES, 2015, 322 : 150 - 160
  • [27] An automatic identification method of imbalanced lithology based on Deep Forest and K-means SMOTE
    Zhu, Xinyi
    Zhang, Hongbing
    Ren, Quan
    Zhang, Dailu
    Zeng, Fanxing
    Zhu, Xinjie
    Zhang, Lingyuan
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 224
  • [28] Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE
    Douzas, Georgios
    Bacao, Fernando
    Last, Felix
    INFORMATION SCIENCES, 2018, 465 : 1 - 20
  • [29] Comparative Analysis of K-Means Method and Naive Bayes Method for Brute Force Attack Visualization
    Stiawan, Deris
    Alzahrani, Esam
    Sandra, Sari
    Budiarto, Rahmat
    2017 2ND INTERNATIONAL CONFERENCE ON ANTI-CYBER CRIMES (ICACC), 2017, : 177 - 182
  • [30] K-means - a fast and efficient K-means algorithms
    Nguyen C.D.
    Duong T.H.
    Nguyen, Cuong Duc (nguyenduccuong@tdt.edu.vn), 2018, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (11) : 27 - 45