An AdaBoost Method with K'K-Means Bayes Classifier for Imbalanced Data

被引：1

作者：

Zhang, Yanfeng ^{[1
]}

Wang, Lichun ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Dept Stat, Beijing 100044, Peoples R China

来源：

MATHEMATICS | 2023年 / 11卷 / 08期

基金：

中国国家自然科学基金;

关键词：

imbalanced data; naive Bayes; imbalanced classifiers; AdaBoost method; ALGORITHM;

D O I：

10.3390/math11081878

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

This article proposes a new AdaBoost method with k'k-means Bayes classifier for imbalanced data. It reduces the imbalance degree of training data through the k'k-means Bayes method and then deals with the imbalanced classification problem using multiple iterations with weight control, achieving a good effect without losing any raw data information or needing to generate more relevant data manually. The effectiveness of the proposed method is verified by comparing it with other traditional methods based on numerical experiments. In the NSL-KDD data experiment, the F-score values of each minority class are also greater than the other methods.

引用

页数：11

共 50 条

[1] K-means Bayes algorithm for imbalanced fault classification and big data application
Chen, Gecheng
Liu, Yue
Ge, Zhiqiang
JOURNAL OF PROCESS CONTROL, 2019, 81 : 54 - 64
[2] Imbalanced data optimization combining K-means and SMOTE
Li W.
International Journal of Performability Engineering, 2019, 15 (08): : 2173 - 2181
[3] An Integration of K-Means Clustering and Naive Bayes Classifier for Intrusion Detection
Varuna, S.
Natesan, P.
2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
[4] A K-means triangular synthesis large margin classifier with unified pinball loss for imbalanced data
Shao, Danlin
Dai, Yixi
Li, Junjie
Li, Shenglin
Chen, Rui
APPLIED SOFT COMPUTING, 2024, 167
[5] An Improved Native Bayes Classifier for Imbalanced Text Categorization Based on K-means and CHI-square Feature Selection
Meng Fanbo
Xu Linying
2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 894 - 898
[6] The k-means forest classifier for high dimensional data
Chen, Zizhong
Ding, Xin
Xia, Shuyin
Chen, Baiyun
2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 322 - 327
[7] 基于K-means Bayes和AdaBoost-SVM的故障分类
黄子扬
周凌柯
计算机系统应用, 2022, 31 (07) : 239 - 246
[8] Undersampled K-means approach for handling imbalanced distributed data
Kumar, N. Santhosh
Rao, K. Nageswara
Govardhan, A.
Reddy, K. Sudheer
Mahmood, Ali Mirza
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2014, 3 (01) : 29 - 38
[9] An Improved Oversampling Method for imbalanced Data-SMOTE Based on Canopy and K-means
Guo, Chaoyou
Ma, Yankun
Xu, Zhe
Cao, Mengmeng
Yao, Qian
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1467 - 1469
[10] A K-Means Remote Sensing Image Classification Method Based On AdaBoost
Zheng, Jian
Cui, Zhanzhong
Liu, Anfei
Jia, Yu
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 27 - +

← 1 2 3 4 5 →