Neighborhood attribute reduction for imbalanced data

被引：0

作者：

Wendong Zhang

Xun Wang

Xibei Yang

Xiangjian Chen

Pingxin Wang

机构：

[1] Jiangsu University of Science and Technology,School of Computer

[2] Jiangsu University of Science and Technology,School of Science

来源：

Granular Computing | 2019年 / 4卷

关键词：

Attribute reduction; Granular computing; K-means; Neighborhood decision error rate; Neighborhood classifier; SMOTE;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

From the viewpoint of rough granular computing, neighborhood decision error rate-based attribute reduction aims to improve the classification performance of the neighborhood classifier. Nevertheless, for imbalanced data which can be seen everywhere in real-world applications, such reduction does not pay much attention to the classification results of samples in minority class. Therefore, a new strategy to attribute reduction is proposed, which is embedded with preprocessing of the imbalanced data. First, the widely accepted SMOTE algorithm and K-means algorithm are used for oversampling and undersampling, respectively. Second, the neighborhood decision error rate-based attribute reduction is designed for those updated data. Finally, the neighborhood classifier can be tested with the attributes in reducts. The experimental results on some UCI and PROMISE data sets show that our approach is superior to the traditional attribute reduction based on the evaluations of F-measure and G-mean. Therefore, the contribution of this paper is to construct the attribute reduction strategy for imbalanced data, which can select useful attributes for improving the classification performance in such data.

引用

页码：301 / 311

页数：10

共 50 条

[31] Numerical attribute reduction based on neighborhood granulation and rough approximation
College of Energy Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
[J]. Ruan Jian Xue Bao, 2008, 3 (640-649):
[32] Spectral Clustering with Neighborhood Attribute Reduction Based on Information Entropy
Jia, Hongjie
Ding, Shifei
Ma, Heng
Xing, Wanqiu
[J]. JOURNAL OF COMPUTERS, 2014, 9 (06) : 1316 - 1324
[33] Attribute reduction on measuring data in neighborhood rough set with common-test-cost and error ranges
刘忠慧
[J]. 科技展望, 2017, (18) : 257 - 261+266
[34] A Mixed Sampling Method for Imbalanced Data Based on Neighborhood Density
Hu, Feng
Yu, Chunlin
Dai, Jin
Liu, Ke
[J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 94 - 98
[35] The research of attribute reduction algorithm based on extension neighborhood relation
Department of Information Engineering, Taiyuan University of Technology, Taiyuan 030024, China
[J]. J. Comput. Inf. Syst., 16 (6613-6620):
[36] Attribute reduction based on neighborhood constrained fuzzy rough sets
Hu, Meng
Guo, Yanting
Chen, Degang
Tsang, Eric C. C.
Zhang, Qingshuo
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 274
[37] Ensemble-Based Neighborhood Attribute Reduction: A Multigranularity View
Gao, Yuan
Chen, Xiangjian
Yang, Xibei
Wang, Pingxin
Mi, Jusheng
[J]. COMPLEXITY, 2019, 2019
[38] Neighborhood Attribute Reduction: A Multicriterion Strategy Based on Sample Selection
Gao, Yuan
Chen, Xiangjian
Yang, Xibei
Wang, Pingxin
[J]. INFORMATION, 2018, 9 (11)
[39] Feature selection for imbalanced data based on neighborhood rough sets
Chen, Hongmei
Li, Tianrui
Fan, Xin
Luo, Chuan
[J]. INFORMATION SCIENCES, 2019, 483 : 1 - 20
[40] Explicitly Semantic Guidance for Face Sketch Attribute Recognition With Imbalanced Data
Shahed, Shahadat
Lin, Yuhao
Hong, Jiangnan
Zhou, Jinglin
Gao, Fei
[J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1502 - 1506

← 1 2 3 4 5 →