An efficient feature selection algorithm for hybrid data

被引:27
|
作者
Wang, Feng [1 ,2 ]
Liang, Jiye [1 ,2 ]
机构
[1] Minist Educ, Key Lab Computat Intelligence & Chinese Informat, Taiyuan, Shanxi, Peoples R China
[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
关键词
Feature selection; Hybrid data; Rough set theory; Large-scale data sets; ROUGH SET; ATTRIBUTE REDUCTION; MUTUAL INFORMATION; DIMENSIONALITY REDUCTION; INCREMENTAL APPROACH; GRANULATION; KNOWLEDGE; RELEVANCE; ENTROPY; SYSTEMS;
D O I
10.1016/j.neucom.2016.01.056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for large-scale data sets has been conceived as a very important data preprocessing step in the area of machine learning. Data sets in real databases usually take on hybrid forms, i.e., the coexistence of categorical and numerical data. In this paper, based on the idea of decomposition and fusion, an efficient feature selection approach for large-scale hybrid data sets is studied. According to this approach, one can get an effective feature subset in a much shorter time. By employing two common classifiers as the evaluation function, experiments have been carried out on twelve UCI data sets. The experimental results show that the proposed approach is effective and efficient. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 41
页数:9
相关论文
共 50 条
  • [11] An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data
    Zhu, Yongbin
    Li, Tao
    Li, Wenshan
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [12] An Efficient Parallel Hybrid Feature Selection Approach for Big Data Analysis
    Azaiz, Mohamed Amine
    Bensaber, Djamel Amar
    [J]. INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2022, 13 (01)
  • [13] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Feng, Fang
    Li, Kuan-Ching
    Yang, Erfu
    Zhou, Qingguo
    Han, Lihong
    Hussain, Amir
    Cai, Mingjiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3231 - 3267
  • [14] A Hybrid Artificial Immune Algorithm for Feature Selection of Ovarian Cancer Data
    Fang, Tian
    Fu, Dongmei
    Zhao, Yunfeng
    [J]. 2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 681 - 685
  • [15] RETRACTED: A hybrid feature selection algorithm for microarray data (Retracted Article)
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (12):
  • [16] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Fang Feng
    Kuan-Ching Li
    Erfu Yang
    Qingguo Zhou
    Lihong Han
    Amir Hussain
    Mingjiang Cai
    [J]. Multimedia Tools and Applications, 2023, 82 : 3231 - 3267
  • [17] A novel hybrid algorithm for feature selection
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. PERSONAL AND UBIQUITOUS COMPUTING, 2018, 22 (5-6) : 971 - 985
  • [18] Exploration of a hybrid feature selection algorithm
    Osei-Bryson, KM
    Giles, K
    Kositanurit, B
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (07) : 790 - 797
  • [19] A novel hybrid algorithm for feature selection
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    [J]. Personal and Ubiquitous Computing, 2018, 22 : 971 - 985
  • [20] A Hybrid Algorithm for Feature Selection and Classification
    Sathish, B. R.
    Senthilkumar, Radha
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (03): : 593 - 602