A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:10
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [1] Feature Selection Using Genetic Algorithm for Big Data
    Saidi, Rania
    Ncir, Waad Bouaguel
    Essoussi, Nadia
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 352 - 361
  • [2] A new wrapper feature selection approach using neural network
    Kabir, Md Monirul
    Islam, Md Monirul
    Murase, Kazuyuki
    [J]. NEUROCOMPUTING, 2010, 73 (16-18) : 3273 - 3283
  • [3] Experimental feature selection using the wrapper approach
    Baranauskas, JA
    Monard, MC
    [J]. DATA MINING, 1998, : 161 - 170
  • [4] Surrogate-Assisted Genetic Algorithm for Wrapper Feature Selection
    Altarabichi, Mohammed Ghaith
    Nowaczyk, Slawomir
    Pashami, Sepideh
    Mashhadi, Peyman Sheikholharam
    [J]. 2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 776 - 785
  • [5] A Wrapper Feature Selection Approach to Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 685 - 700
  • [6] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Mohammed, Tareq Abed
    Bayat, Oguz
    Ucan, Osman N.
    Alhayali, Shaymaa
    [J]. FOUNDATIONS OF SCIENCE, 2020, 25 (04) : 1009 - 1025
  • [7] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Tareq Abed Mohammed
    Oguz Bayat
    Osman N. Uçan
    Shaymaa Alhayali
    [J]. Foundations of Science, 2020, 25 : 1009 - 1025
  • [8] Wrapper approach for feature subset selection using GA
    Zhou, Huilin
    Wu, Jianbin
    Wang, Yuhao
    Tian, Mao
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 224 - +
  • [9] Analysis of Feature Selection and Extraction Algorithm for Loan Data: A Big Data Approach
    Attigeri, Girija
    Pai, Manohara M. M.
    Pai, Radhika M.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 2147 - 2151
  • [10] A Genetic Based Wrapper Feature Selection Approach Using Nearest Neighbour Distance Matrix
    Sainin, Mohd Shamrie
    Alfred, Rayner
    [J]. 2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2011, : 237 - 242