Distributed feature selection: An application to microarray data classification

被引:126
|
作者
Bolon-Canedo, V. [1 ]
Sanchez-Marono, N. [1 ]
Alonso-Betanzos, A. [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, Lab Res & Dev Artificial Intelligence LIDIA, La Coruna 15071, Spain
关键词
Feature selection; Distributed learning; Microarray data; ENSEMBLE;
D O I
10.1016/j.asoc.2015.01.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is often required as a preliminary step for many pattern recognition problems. However, most of the existing algorithms only work in a centralized fashion, i.e. using the whole dataset at once. In this research a new method for distributing the feature selection process is proposed. It distributes the data by features, i.e. according to a vertical distribution, and then performs a merging procedure which updates the feature subset according to improvements in the classification accuracy. The effectiveness of our proposal is tested on microarray data, which has brought a difficult challenge for researchers due to the high number of gene expression contained and the small samples size. The results on eight microarray datasets show that the execution time is considerably shortened whereas the performance is maintained or even improved compared to the standard algorithms applied to the non-partitioned datasets. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:136 / 150
页数:15
相关论文
共 50 条
  • [1] Efficient feature selection and classification for microarray data
    Li, Zifa
    Xie, Weibo
    Liu, Tao
    [J]. PLOS ONE, 2018, 13 (08):
  • [2] Comparison of population based metaheuristics for feature selection:: Application to microarray data classification
    Talbi, E-G.
    Jourdan, L.
    Garcia-Nieto, J.
    Alba, E.
    [J]. 2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 45 - +
  • [3] Feature Selection for Cancer Classification on Microarray Expression Data
    Hsu, Hui-Huang
    Lu, Ming-Da
    [J]. ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 153 - 158
  • [4] Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance
    Potharaju, Sai Prasad
    Sreedevi, M.
    [J]. CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02): : 171 - 176
  • [5] Exploring the consequences of distributed feature selection in DNA microarray data
    Bolon-Canedo, Veronica
    Sechidis, Konstantinos
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    Brown, Gavin
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1665 - 1672
  • [6] Feature selection in independent component subspace for microarray data classification
    Zheng, Chun-Hou
    huang, De-S Huang
    Shang, Li
    [J]. NEUROCOMPUTING, 2006, 69 (16-18) : 2407 - 2410
  • [7] Feature selection using differential evolution for microarray data classification
    Prajapati S.
    Das H.
    Gourisaria M.K.
    [J]. Discover Internet of Things, 2023, 3 (01):
  • [8] Stable feature selection and classification algorithms for multiclass microarray data
    Student, Sebastian
    Fujarewicz, Krzysztof
    [J]. BIOLOGY DIRECT, 2012, 7
  • [9] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [10] An enhanced feature selection filter for classification of microarray cancer data
    Mazumder, Dilwar Hussain
    Veilumuthu, Ramachandran
    [J]. ETRI JOURNAL, 2019, 41 (03) : 358 - 370