An efficient feature selection algorithm for hybrid data

被引:27
|
作者
Wang, Feng [1 ,2 ]
Liang, Jiye [1 ,2 ]
机构
[1] Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan, Shanxi, Peoples R China
[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
关键词
Feature selection; Hybrid data; Rough set theory; Large-scale data sets; ROUGH SET; ATTRIBUTE REDUCTION; MUTUAL INFORMATION; DIMENSIONALITY REDUCTION; INCREMENTAL APPROACH; GRANULATION; KNOWLEDGE; RELEVANCE; ENTROPY; SYSTEMS;
D O I
10.1016/j.neucom.2016.01.056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for large-scale data sets has been conceived as a very important data preprocessing step in the area of machine learning. Data sets in real databases usually take on hybrid forms, i.e., the coexistence of categorical and numerical data. In this paper, based on the idea of decomposition and fusion, an efficient feature selection approach for large-scale hybrid data sets is studied. According to this approach, one can get an effective feature subset in a much shorter time. By employing two common classifiers as the evaluation function, experiments have been carried out on twelve UCI data sets. The experimental results show that the proposed approach is effective and efficient. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 41
页数:9
相关论文
共 50 条
  • [1] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Mohammed, Tareq Abed
    Bayat, Oguz
    Ucan, Osman N.
    Alhayali, Shaymaa
    [J]. FOUNDATIONS OF SCIENCE, 2020, 25 (04) : 1009 - 1025
  • [2] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Tareq Abed Mohammed
    Oguz Bayat
    Osman N. Uçan
    Shaymaa Alhayali
    [J]. Foundations of Science, 2020, 25 : 1009 - 1025
  • [3] A hybrid feature selection algorithm for microarray data
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (05): : 3494 - 3526
  • [4] A hybrid feature selection algorithm for microarray data
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    [J]. The Journal of Supercomputing, 2020, 76 : 3494 - 3526
  • [5] Hybrid genetic algorithm for feature selection with hyperspectral data
    Pal, Mahesh
    [J]. REMOTE SENSING LETTERS, 2013, 4 (07) : 619 - 628
  • [6] A Robust and Efficient Feature Selection Algorithm for Microarray Data
    Bari, Mehrab Ghanat
    Salekin, Sirajul
    Zhang, Jianqiu
    [J]. MOLECULAR INFORMATICS, 2017, 36 (04)
  • [7] A Hybrid Feature Selection Algorithm
    Yin, Chunyong
    Ma, Luyu
    Feng, Lu
    Wang, Jin
    Yin, Zhichao
    Kim, Jeong-Uk
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGY AND SENSOR APPLICATION (AITS), 2015, : 104 - 107
  • [8] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    [J]. NEUROCOMPUTING, 2017, 256 : 56 - 62
  • [9] A Hybrid Feature Selection Algorithm For Classification Unbalanced Data Processsing
    Zhang, Xue
    Shi, Zhiguo
    Liu, Xuan
    Li, Xueni
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SMART INTERNET OF THINGS (SMARTIOT 2018), 2018, : 269 - 275
  • [10] A Novel Scalable and Data Efficient Feature Subset Selection Algorithm
    de Morais, Sergio Rodrigues
    Aussem, Alex
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 298 - +