Multiobjective feature selection for microarray data via distributed parallel algorithms

被引:29
|
作者
Cao, Bin [1 ,2 ]
Zhao, Jianwei [1 ,2 ]
Yang, Po [3 ]
Yang, Peng [2 ]
Liu, Xin [1 ]
Qi, Jun [3 ]
Simpson, Andrew [3 ]
Elhoseny, Mohamed [4 ]
Mehmoode, Irfan [5 ]
Muhammad, Khan [6 ]
机构
[1] Hebei Univ Technol, State Key Lab Reliabil & Intelligence Elect Equip, Tianjin, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin, Peoples R China
[3] Liverpool John Moores Univ, Dept Comp Sci, Liverpool, Merseyside, England
[4] Mansoura Univ, Fac Computers & Informat, Mansoura, Egypt
[5] Univ Bradford, Dept Media Design & Technol, Fac Engn & Informat, Bradford BD7 1DP, W Yorkshire, England
[6] Sejong Univ, Dept Software, Seoul 143747, South Korea
关键词
Microarray dataset; High dimension; Multiobjective feature selection; Distributed parallelism; Feature redundancy; DIFFERENTIAL EVOLUTION; OPTIMIZATION; CLASSIFICATION;
D O I
10.1016/j.future.2019.02.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many real-world problems are large in scale and hence difficult to address. Due to the large number of features in microarray datasets, feature selection and classification are even more challenging for such datasets. Not all of these numerous features contribute to the classification task, and some even impede performance. Through feature selection, a feature subset that contains only a small quantity of essential features can be generated to increase the classification accuracy and significantly reduce the time consumption. In this paper, we construct a multiobjective feature selection model that simultaneously considers the classification error, the feature number and the feature redundancy. For this model, we propose several distributed parallel algorithms based on different encodings and an adaptive strategy. Additionally, to reduce the time consumption, various tactics are employed, including a feature number constraint, distributed parallelism and sample-wise parallelism. For a batch of microarray datasets, the proposed algorithms are superior to several state-of-the-art multiobjective evolutionary algorithms in terms of both effectiveness and efficiency. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:952 / 981
页数:30
相关论文
共 50 条
  • [1] Comparing Multiobjective Evolutionary Algorithms for Cancer Data Microarray Feature Selection
    Sol Dussaut, Julieta
    Javier Vidal, Pablo
    Ponzoni, Ignacio
    Carolina Olivera, Ana
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 149 - 156
  • [2] Memetic algorithms for feature selection on microarray data
    Zhu, Zexuan
    Ong, Yew-Soon
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 1327 - +
  • [3] Distributed feature selection: An application to microarray data classification
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    [J]. APPLIED SOFT COMPUTING, 2015, 30 : 136 - 150
  • [4] Stable feature selection and classification algorithms for multiclass microarray data
    Sebastian Student
    Krzysztof Fujarewicz
    [J]. Biology Direct, 7
  • [5] Stable feature selection and classification algorithms for multiclass microarray data
    Student, Sebastian
    Fujarewicz, Krzysztof
    [J]. BIOLOGY DIRECT, 2012, 7
  • [6] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [7] Exploring the consequences of distributed feature selection in DNA microarray data
    Bolon-Canedo, Veronica
    Sechidis, Konstantinos
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Brown, Gavin
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1665 - 1672
  • [8] Fast feature selection from microarray expression data via multiplicative large margin algorithms
    Gentile, C
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 121 - 128
  • [9] Multiobjective Evolutionary Algorithms applied to Feature Selection in Microarrays Cancer Data
    Dussaut, J. S.
    Ponzoni, I
    Olivera, A. C.
    Vidal, P. J.
    [J]. ENTRE CIENCIA E INGENIERIA, 2020, 14 (28): : 40 - 45
  • [10] A Study of Metaheuristic Algorithms for High Dimensional Feature Selection on Microarray Data
    Dankolo, Muhammad Nasiru
    Radzi, Nor Haizan Mohamed
    Sallehuddin, Roselina
    Mustaffa, Noorfa Haszlinna
    [J]. 13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017), 2017, 1905