A group incremental feature selection for classification using rough set theory based genetic algorithm

被引:107
|
作者
Das, Asit K. [1 ]
Sengupta, Shampa [2 ]
Bhattacharyya, Siddhartha [3 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Howrah 711103, W Bengal, India
[2] MCKV Inst Engn, Dept Informat Technol, Howrah 711204, W Bengal, India
[3] RCC Inst Informat Technol, Dept Comp Applicat, Kolkata, India
关键词
Data mining; Rough set theory; Genetic algorithm; Incremental data; Feature selection; Classification; ATTRIBUTE REDUCTION;
D O I
10.1016/j.asoc.2018.01.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Mining is one of the most challenging tasks in a dynamic environment due to rapid growth of data with respect to time. Dimension reduction, the key process of relevant feature selection, is applied prior to extracting interesting patterns or information from large repositories of data. In a dynamic environment, newly generated group of data together with the information extracted from the previous data are analyzed to select the most relevant and important features of the entire data set. As a result, efficiency and acceptability of the incremental feature selection model increase in the field of data mining. In our paper, a group incremental feature selection algorithm is proposed using rough set theory based genetic algorithm for selecting the optimized and relevant feature subset, called reduct. The objective function of the genetic algorithm used for incremental feature selection is defined using the previously generated reduct and positive region of the target set, concepts of rough set theory. The method may be applied in a regular basis in the dynamic environment after small to moderate volume of data being added into the system and thus the computational time, the major issue of the genetic algorithm does not affect the proposed method. Experimental results on benchmark datasets demonstrate that the proposed method provides satisfactory results in terms of number of selected features, computation time and classification accuracies of various classifiers. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:400 / 411
页数:12
相关论文
共 50 条
  • [41] Attribute clustering using rough set theory for feature selection in fault severity classification of rotating machinery
    Pacheco, Fannia
    Cerrada, Mariela
    Sanchez, Rene-Vinicio
    Cabrera, Diego
    Li, Chuan
    de Oliveira, Jose Valente
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 : 69 - 86
  • [42] Trajectory Classification Using Feature Selection by Genetic Algorithm
    Saini, Rajkumar
    Kumar, Pradeep
    Roy, Partha Pratim
    Pal, Umapada
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 377 - 388
  • [43] An approach for selective ensemble feature selection based on rough set theory
    Yang, Yong
    Wang, Guoyin
    He, Kun
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2007, 4481 : 518 - +
  • [44] Feature Selection Based on Ant Colony Optimization and Rough Set Theory
    He, Ming
    [J]. ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 247 - 250
  • [45] Covering rough set-based incremental feature selection for mixed decision system
    Yang, Yanyan
    Chen, Degang
    Zhang, Xiao
    Ji, Zhenyan
    [J]. SOFT COMPUTING, 2022, 26 (06) : 2651 - 2669
  • [46] Rough Set Based Unsupervised Feature Selection in Mammogram Image Classification Using Entropy Measure
    Thangavel, K.
    Velayutham, C.
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2012, 2 (03) : 320 - 326
  • [47] Covering rough set-based incremental feature selection for mixed decision system
    Yanyan Yang
    Degang Chen
    Xiao Zhang
    Zhenyan Ji
    [J]. Soft Computing, 2022, 26 : 2651 - 2669
  • [48] Rough set theory in discretization method based on genetic algorithm
    Huang, Lei
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2089 - 2092
  • [49] A neurofuzzy system based on rough set theory and genetic algorithm
    罗健旭
    邵惠鹤
    [J]. Journal of Harbin Institute of Technology(New series), 2005, (03) : 278 - 282
  • [50] Fractional Calculus-Based Slime Mould Algorithm for Feature Selection Using Rough Set
    Ibrahim, Rehab Ali
    Yousri, Dalia
    Abd Elaziz, Mohamed
    Alshathri, Samah
    Attiya, Ibrahim
    [J]. IEEE ACCESS, 2021, 9 : 131625 - 131636