Hybrid feature selection using micro genetic algorithm on microarray gene expression data

被引:11
|
作者
Pragadeesh, C. [1 ]
Jeyaraj, Rohana [1 ]
Siranjeevi, K. [1 ]
Abishek, R. [1 ]
Jeyakumar, G. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Genetic algorithm; feature selection; microarray; hybrid methods; classification;
D O I
10.3233/JIFS-169935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded / hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.
引用
收藏
页码:2241 / 2246
页数:6
相关论文
共 50 条
  • [1] Improving feature subset selection using a genetic algorithm for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Zhang, Yanqing
    Bourgeois, Anu G.
    [J]. 2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 2514 - 2519
  • [2] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [3] A hybrid feature selection algorithm for microarray data
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (05): : 3494 - 3526
  • [4] A hybrid feature selection algorithm for microarray data
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    [J]. The Journal of Supercomputing, 2020, 76 : 3494 - 3526
  • [5] A hybrid LDA and genetic algorithm for gene selection and classification of microarray data
    Bonilla Huerta, Edmundo
    Duval, Beatrice
    Hao, Jin-Kao
    [J]. NEUROCOMPUTING, 2010, 73 (13-15) : 2375 - 2383
  • [6] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    [J]. NEUROCOMPUTING, 2017, 256 : 56 - 62
  • [7] A Top-r Feature Selection Algorithm for Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (03) : 754 - 764
  • [8] Selection for feature gene subset in Microarray expression profiles based on a hybrid algorithm using SVM and GA
    Xiong, Wei
    Zhang, Chen
    Zhou, Chunguang
    Liang, Yanchun
    [J]. FRONTIERS OF HIGH PERFORMANCE COMPUTING AND NETWORKING - ISPA 2006 WORKSHOPS, PROCEEDINGS, 2006, 4331 : 637 - +
  • [9] A hybrid multi-objective genetic algorithm for gene selection in microarray data
    Su, Yizhou
    Zhao, Guohua
    Lin, Yusong
    [J]. PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 443 - 449
  • [10] Gene expression data classification using genetic algorithm-based feature selection
    Sonmez, Oznur Sinem
    Dagtekin, Mustafa
    Ensari, Tolga
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (07) : 3165 - 3179