A group incremental feature selection for classification using rough set theory based genetic algorithm

被引:107
|
作者
Das, Asit K. [1 ]
Sengupta, Shampa [2 ]
Bhattacharyya, Siddhartha [3 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Howrah 711103, W Bengal, India
[2] MCKV Inst Engn, Dept Informat Technol, Howrah 711204, W Bengal, India
[3] RCC Inst Informat Technol, Dept Comp Applicat, Kolkata, India
关键词
Data mining; Rough set theory; Genetic algorithm; Incremental data; Feature selection; Classification; ATTRIBUTE REDUCTION;
D O I
10.1016/j.asoc.2018.01.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Mining is one of the most challenging tasks in a dynamic environment due to rapid growth of data with respect to time. Dimension reduction, the key process of relevant feature selection, is applied prior to extracting interesting patterns or information from large repositories of data. In a dynamic environment, newly generated group of data together with the information extracted from the previous data are analyzed to select the most relevant and important features of the entire data set. As a result, efficiency and acceptability of the incremental feature selection model increase in the field of data mining. In our paper, a group incremental feature selection algorithm is proposed using rough set theory based genetic algorithm for selecting the optimized and relevant feature subset, called reduct. The objective function of the genetic algorithm used for incremental feature selection is defined using the previously generated reduct and positive region of the target set, concepts of rough set theory. The method may be applied in a regular basis in the dynamic environment after small to moderate volume of data being added into the system and thus the computational time, the major issue of the genetic algorithm does not affect the proposed method. Experimental results on benchmark datasets demonstrate that the proposed method provides satisfactory results in terms of number of selected features, computation time and classification accuracies of various classifiers. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:400 / 411
页数:12
相关论文
共 50 条
  • [1] Rough set Theory-Based group incremental approach to feature selection
    Zhao, Jie
    Wu, Dai-yang
    Zhou, Yong-xin
    Liang, Jia-ming
    Wei, WenHong
    Li, Yun
    [J]. INFORMATION SCIENCES, 2024, 675
  • [2] A hybrid genetic algorithm for feature subset selection in rough set theory
    Si-Yuan Jing
    [J]. Soft Computing, 2014, 18 : 1373 - 1382
  • [3] A hybrid genetic algorithm for feature subset selection in rough set theory
    Jing, Si-Yuan
    [J]. SOFT COMPUTING, 2014, 18 (07) : 1373 - 1382
  • [4] An Exact Feature Selection Algorithm Based on Rough Set Theory
    Rezvan, Mohammad Taghi
    Hamadani, Ali Zeinal
    Hejazi, Seyed Reza
    [J]. COMPLEXITY, 2015, 20 (05) : 50 - 62
  • [5] A Novel Algorithm for Feature Selection Based on Rough set Theory
    Zhou Feng-xiang
    Mu Chun-ge
    Xu Qun-san
    Zhang Xiao-feng
    [J]. 2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 800 - +
  • [6] Consistency approximation: Incremental feature selection based on fuzzy rough set theory
    Zhao, Jie
    Wu, Daiyang
    Wu, Jiaxin
    Ye, Wenhao
    Huang, Faliang
    Wang, Jiahai
    See-To, Eric W. K.
    [J]. PATTERN RECOGNITION, 2024, 155
  • [7] Clustering algorithm using rough set theory for unsupervised feature selection
    Pacheco, Fannia
    Cerrada, Mariela
    Li, Chuan
    Sanchez, Rene Vinicio
    Cabrera, Diego
    de Oliveira, Jose Valente
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3493 - 3499
  • [8] A Group Incremental Approach to Feature Selection Applying Rough Set Technique
    Liang, Jiye
    Wang, Feng
    Dang, Chuangyin
    Qian, Yuhua
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) : 294 - 308
  • [9] Network traffic classification using rough set theory and genetic algorithm
    Li, Ning
    Chen, Zilong
    Zhou, Gang
    [J]. INTELLIGENT CONTROL AND AUTOMATION, 2006, 344 : 945 - 950
  • [10] Breast Cancer Classification Based on Improved Rough Set Theory Feature Selection
    Farouk, R. M.
    Mustafa, Heba, I
    Ali, Abd Elmounem
    [J]. FILOMAT, 2020, 34 (01) : 19 - 34