An AdaBoost Method with K'K-Means Bayes Classifier for Imbalanced Data

被引：1

作者：

Zhang, Yanfeng ^{[1
]}

Wang, Lichun ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Dept Stat, Beijing 100044, Peoples R China

来源：

MATHEMATICS | 2023年 / 11卷 / 08期

基金：

中国国家自然科学基金;

关键词：

imbalanced data; naive Bayes; imbalanced classifiers; AdaBoost method; ALGORITHM;

D O I：

10.3390/math11081878

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

This article proposes a new AdaBoost method with k'k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the k'k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.

引用

页数：11

共 50 条

[21] Incremental k-Means Method
Prasad, Rabinder Kumar
Sarmah, Rosy
Chakraborty, Subrata
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 38 - 46
[22] k-means nearest neighbor classifier for voice pathology
Ananthakrishna, T
Shama, K
Niranjan, UC
Proceedings of the IEEE INDICON 2004, 2004, : 352 - 354
[23] A Constructing Method of Fuzzy Classifier Using Kernel K-means Clustering Algorithm
Yang, Aimin
Li, Qing
Li, Xinguang
2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 2, 2009, : 73 - +
[24] Clusterization by the K-means method when K is unknown
Litvinenko, Natalya
Mamyrbayev, Orken
Shayakhmetova, Assem
Turdalyuly, Mussa
AMCSE 2018 - INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2019, 24
[25] A K-means Clustering Based Under-Sampling Method for Imbalanced Dataset Classification
Huang, Chih-Ming
Hung, Chuan-Sheng
Hsu, Yao-Yuan
Zheng, You-Cheng
Yu, Cheng-Han
Lin, Chun-Hung Richard
Chen, Shi-Huang
38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 708 - 713
[26] Kernel Penalized K-means: A feature selection method based on Kernel K-means
Maldonado, Sebastian
Carrizosa, Emilio
Weber, Richard
INFORMATION SCIENCES, 2015, 322 : 150 - 160
[27] An automatic identification method of imbalanced lithology based on Deep Forest and K-means SMOTE
Zhu, Xinyi
Zhang, Hongbing
Ren, Quan
Zhang, Dailu
Zeng, Fanxing
Zhu, Xinjie
Zhang, Lingyuan
GEOENERGY SCIENCE AND ENGINEERING, 2023, 224
[28] Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE
Douzas, Georgios
Bacao, Fernando
Last, Felix
INFORMATION SCIENCES, 2018, 465 : 1 - 20
[29] Comparative Analysis of K-Means Method and Naive Bayes Method for Brute Force Attack Visualization
Stiawan, Deris
Alzahrani, Esam
Sandra, Sari
Budiarto, Rahmat
2017 2ND INTERNATIONAL CONFERENCE ON ANTI-CYBER CRIMES (ICACC), 2017, : 177 - 182
[30] K-means - a fast and efficient K-means algorithms
Nguyen C.D.
Duong T.H.
Nguyen, Cuong Duc (nguyenduccuong@tdt.edu.vn), 2018, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (11) : 27 - 45

← 1 2 3 4 5 →