Multiobjective feature selection for microarray data via distributed parallel algorithms

被引:29
|
作者
Cao, Bin [1 ,2 ]
Zhao, Jianwei [1 ,2 ]
Yang, Po [3 ]
Yang, Peng [2 ]
Liu, Xin [1 ]
Qi, Jun [3 ]
Simpson, Andrew [3 ]
Elhoseny, Mohamed [4 ]
Mehmoode, Irfan [5 ]
Muhammad, Khan [6 ]
机构
[1] Hebei Univ Technol, State Key Lab Reliabil & Intelligence Elect Equip, Tianjin, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin, Peoples R China
[3] Liverpool John Moores Univ, Dept Comp Sci, Liverpool, Merseyside, England
[4] Mansoura Univ, Fac Computers & Informat, Mansoura, Egypt
[5] Univ Bradford, Dept Media Design & Technol, Fac Engn & Informat, Bradford BD7 1DP, W Yorkshire, England
[6] Sejong Univ, Dept Software, Seoul 143747, South Korea
关键词
Microarray dataset; High dimension; Multiobjective feature selection; Distributed parallelism; Feature redundancy; DIFFERENTIAL EVOLUTION; OPTIMIZATION; CLASSIFICATION;
D O I
10.1016/j.future.2019.02.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many real-world problems are large in scale and hence difficult to address. Due to the large number of features in microarray datasets, feature selection and classification are even more challenging for such datasets. Not all of these numerous features contribute to the classification task, and some even impede performance. Through feature selection, a feature subset that contains only a small quantity of essential features can be generated to increase the classification accuracy and significantly reduce the time consumption. In this paper, we construct a multiobjective feature selection model that simultaneously considers the classification error, the feature number and the feature redundancy. For this model, we propose several distributed parallel algorithms based on different encodings and an adaptive strategy. Additionally, to reduce the time consumption, various tactics are employed, including a feature number constraint, distributed parallelism and sample-wise parallelism. For a batch of microarray datasets, the proposed algorithms are superior to several state-of-the-art multiobjective evolutionary algorithms in terms of both effectiveness and efficiency. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:952 / 981
页数:30
相关论文
共 50 条
  • [21] Efficient feature selection and classification for microarray data
    Li, Zifa
    Xie, Weibo
    Liu, Tao
    [J]. PLOS ONE, 2018, 13 (08):
  • [22] Spectral Clustering and Feature Selection for Microarray Data
    Garcia-Garcia, Dario
    Santos-Rodriguez, Raul
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 425 - 428
  • [23] A hybrid feature selection algorithm for microarray data
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (05): : 3494 - 3526
  • [24] Boosting for Feature Selection for Microarray Data Analysis
    Guile, Geoffrey R.
    Wang, Wenjia
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2559 - 2563
  • [25] MaskedPainter: Feature selection for microarray data analysis
    Apiletti, Daniele
    Baralis, Elena
    Bruno, Giulia
    Fiori, Alessandro
    [J]. INTELLIGENT DATA ANALYSIS, 2012, 16 (04) : 717 - 737
  • [26] A hybrid feature selection algorithm for microarray data
    Yuefeng Zheng
    Ying Li
    Gang Wang
    Yupeng Chen
    Qian Xu
    Jiahao Fan
    Xueting Cui
    [J]. The Journal of Supercomputing, 2020, 76 : 3494 - 3526
  • [27] A Composite Method for Feature Selection of Microarray Data
    Li, Zejun
    Yang, Ang
    Chen, Xia
    Zeng, Lijun
    Cao, Tao
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (02) : 472 - 476
  • [28] A novel parallel feature rank aggregation algorithm for gene selection applied to microarray data classification
    Longkumer, Imtisenla
    Mazumder, Dilwar Hussain
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 112
  • [29] Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance
    Potharaju, Sai Prasad
    Sreedevi, M.
    [J]. CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02): : 171 - 176
  • [30] Efficient and Intelligent Feature Selection via Maximum Conditional Mutual Information for Microarray Data
    Zhang, Jiangnan
    Li, Shaojing
    Yang, Huaichuan
    Jiang, Jingtao
    Shi, Hongtao
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (13):