Multiobjective feature selection for microarray data via distributed parallel algorithms

被引:29
|
作者
Cao, Bin [1 ,2 ]
Zhao, Jianwei [1 ,2 ]
Yang, Po [3 ]
Yang, Peng [2 ]
Liu, Xin [1 ]
Qi, Jun [3 ]
Simpson, Andrew [3 ]
Elhoseny, Mohamed [4 ]
Mehmoode, Irfan [5 ]
Muhammad, Khan [6 ]
机构
[1] Hebei Univ Technol, State Key Lab Reliabil & Intelligence Elect Equip, Tianjin, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin, Peoples R China
[3] Liverpool John Moores Univ, Dept Comp Sci, Liverpool, Merseyside, England
[4] Mansoura Univ, Fac Computers & Informat, Mansoura, Egypt
[5] Univ Bradford, Dept Media Design & Technol, Fac Engn & Informat, Bradford BD7 1DP, W Yorkshire, England
[6] Sejong Univ, Dept Software, Seoul 143747, South Korea
关键词
Microarray dataset; High dimension; Multiobjective feature selection; Distributed parallelism; Feature redundancy; DIFFERENTIAL EVOLUTION; OPTIMIZATION; CLASSIFICATION;
D O I
10.1016/j.future.2019.02.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many real-world problems are large in scale and hence difficult to address. Due to the large number of features in microarray datasets, feature selection and classification are even more challenging for such datasets. Not all of these numerous features contribute to the classification task, and some even impede performance. Through feature selection, a feature subset that contains only a small quantity of essential features can be generated to increase the classification accuracy and significantly reduce the time consumption. In this paper, we construct a multiobjective feature selection model that simultaneously considers the classification error, the feature number and the feature redundancy. For this model, we propose several distributed parallel algorithms based on different encodings and an adaptive strategy. Additionally, to reduce the time consumption, various tactics are employed, including a feature number constraint, distributed parallelism and sample-wise parallelism. For a batch of microarray datasets, the proposed algorithms are superior to several state-of-the-art multiobjective evolutionary algorithms in terms of both effectiveness and efficiency. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:952 / 981
页数:30
相关论文
共 50 条
  • [31] Best Feature Selection for Horizontally Distributed Private Biomedical Data Based on Genetic Algorithms
    Tarik, Boudheb
    Zakaria, Elberrichi
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2019, 10 (03) : 37 - 57
  • [32] Parallel Asynchronous Strategies for the Execution of Feature Selection Algorithms
    Silva, Jorge
    Aguiar, Ana
    Silva, Fernando
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (02) : 252 - 283
  • [33] Parallel Asynchronous Strategies for the Execution of Feature Selection Algorithms
    Jorge Silva
    Ana Aguiar
    Fernando Silva
    International Journal of Parallel Programming, 2018, 46 : 252 - 283
  • [34] Multiobjective recommendation optimization via utilizing distributed parallel algorithm
    Cao, Bin
    Zhao, Jianwei
    Liu, Xin
    Kang, Xinyuan
    Yang, Shan
    Kang, Kai
    Yu, Ming
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 1259 - 1268
  • [35] Hadoop neural network for parallel and distributed feature selection
    Hodge, Victoria J.
    O'Keefe, Simon
    Austin, Jim
    NEURAL NETWORKS, 2016, 78 : 24 - 35
  • [36] Parallel feature selection for distributed-memory clusters
    Gonzalez-Dominguez, Jorge
    Bolon-Canedo, Veronica
    Freire, Borja
    Tourino, Juan
    INFORMATION SCIENCES, 2019, 496 : 399 - 409
  • [37] A hybrid feature selection method for DNA microarray data
    Chuang, Li-Yeh
    Yang, Cheng-Huei
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2011, 41 (04) : 228 - 237
  • [38] Graph Based Unsupervised Feature Selection for Microarray Data
    Swarnkar, Tripti
    Mitra, Pabitra
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [39] Feature Selection for Cancer Classification on Microarray Expression Data
    Hsu, Hui-Huang
    Lu, Ming-Da
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 153 - 158
  • [40] Comparative study of feature selection methods on microarray data
    Miyamoto, T
    Uchimura, S
    Hamamoto, Y
    Iizuka, N
    Oka, M
    Yamada-Okabe, H
    IEEE EMBS APBME 2003, 2003, : 82 - 83