Efficient feature selection and classification algorithm based on PSO and rough sets

被引:23
|
作者
Huda, Ramesh Kumar [1 ]
Banka, Haider [1 ]
机构
[1] Indian Inst Technol ISM, Dhanbad, Bihar, India
来源
NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 08期
关键词
Feature selection; Rough sets; New quick reduct; Inconsistency handler; Classification; Fitness function; Particle Swarm Optimization; OPTIMIZATION; REDUCTION;
D O I
10.1007/s00521-017-3317-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high-dimensional data are often characterized by more number of features with less number of instances. Many of the features are irrelevant and redundant. These features may be especially harmful in case of extreme number of features carries the problem of memory usage in order to represent the datasets. On the other hand relatively small training set, where this irrelevancy and redundancy makes harder to evaluate. Hence, in this paper we propose an efficient feature selection and classification method based on Particle Swarm Optimization (PSO) and rough sets. In this study, we propose the inconsistency handler algorithm for handling inconsistency in dataset, new quick reduct algorithm for handling irrelevant/noisy features and fitness function with three parameters, the classification quality of feature subset, remaining features and the accuracy of approximation. The proposed method is compared with two traditional and three fusion of PSO and rough set-based feature selection methods. In this study, Decision Tree and Naive Bayes classifiers are used to calculate the classification accuracy of the selected feature subset on nine benchmark datasets. The result shows that the proposed method can automatically selects small feature subset with better classification accuracy than using all features. The proposed method also outperforms the two traditional and three existing PSO and rough set-based feature selection methods in terms of the classification accuracy, cardinality of feature and stability indices. It is also observed that with increased weight on the classification quality of feature subset of the fitness function, there is a significant reduction in the cardinality of features and also achieve better classification accuracy as well.
引用
收藏
页码:4287 / 4303
页数:17
相关论文
共 50 条
  • [1] Efficient feature selection and classification algorithm based on PSO and rough sets
    Ramesh Kumar Huda
    Haider Banka
    [J]. Neural Computing and Applications, 2019, 31 : 4287 - 4303
  • [2] Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis
    Inbarani, H. Hannah
    Azar, Ahmad Taher
    Jothi, G.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2014, 113 (01) : 175 - 185
  • [3] A New Heuristic Feature Selection Algorithm Based on Rough Sets
    Zhao, Hua
    Qin, Keyun
    Qiu, Xiaoping
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 93 : 147 - +
  • [4] A new algorithm for feature selection based on rough sets theory
    Caballero, Yaile
    Alvarez, Delia
    Balta, Analay
    Bello, Rafael
    Garcia, Maria
    [J]. REVISTA FACULTAD DE INGENIERIA-UNIVERSIDAD DE ANTIOQUIA, 2007, (41): : 132 - 144
  • [5] A Rough Based Hybrid Binary PSO Algorithm for Flat Feature Selection and Classification in Gene Expression Data
    Dara S.
    Banka H.
    Annavarapu C.S.R.
    [J]. Annals of Data Science, 2017, 4 (3) : 341 - 360
  • [6] Feature selection with rough sets for web page classification
    An, AJ
    Huang, YH
    Huang, XJ
    Cercone, N
    [J]. TRANSACTIONS ON ROUGH SETS II: ROUGH SETS AND FUZZY SETS, 2004, 3135 : 1 - 13
  • [7] Fuzzy Rough Sets-Based Incremental Feature Selection for Hierarchical Classification
    Huang, Wanli
    She, Yanhong
    He, Xiaoli
    Ding, Weiping
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (10) : 3721 - 3733
  • [8] Generalized rough sets based feature selection
    Quafafou, Mohamed Quafafou
    Boussouf, Moussa
    [J]. Intelligent Data Analysis, 2000, 4 (01) : 3 - 17
  • [9] A rough sets based approach to feature selection
    Zhang, M
    Yao, JT
    [J]. NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 434 - 439
  • [10] Parallel Feature Selection Algorithm based on Rough Sets and Particle Swarm Optimization
    Adamczyk, Mateusz
    [J]. FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 43 - 50