Distributed feature selection: An application to microarray data classification

被引:126
|
作者
Bolon-Canedo, V. [1 ]
Sanchez-Marono, N. [1 ]
Alonso-Betanzos, A. [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, Lab Res & Dev Artificial Intelligence LIDIA, La Coruna 15071, Spain
关键词
Feature selection; Distributed learning; Microarray data; ENSEMBLE;
D O I
10.1016/j.asoc.2015.01.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is often required as a preliminary step for many pattern recognition problems. However, most of the existing algorithms only work in a centralized fashion, i.e. using the whole dataset at once. In this research a new method for distributing the feature selection process is proposed. It distributes the data by features, i.e. according to a vertical distribution, and then performs a merging procedure which updates the feature subset according to improvements in the classification accuracy. The effectiveness of our proposal is tested on microarray data, which has brought a difficult challenge for researchers due to the high number of gene expression contained and the small samples size. The results on eight microarray datasets show that the execution time is considerably shortened whereas the performance is maintained or even improved compared to the standard algorithms applied to the non-partitioned datasets. (C) 2015 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:136 / 150
页数:15
相关论文
共 50 条
  • [31] Microarray Lung Cancer Data Classification Using Similarity based Feature Selection
    Amrane, Meriem
    Oukid, Saliha
    Ensari, Tolga
    Benblidia, Nadjia
    Orman, Zeynep
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [32] A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification
    Sun, Shiquan
    Peng, Qinke
    Shakoor, Adnan
    PLOS ONE, 2014, 9 (07):
  • [33] Feature Selection for Self-Supervised Classification With Applications to Microarray and Sequence Data
    Kung, Sun-Yuan
    Mak, Man-Wai
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2008, 2 (03) : 297 - 309
  • [34] Microarray Data Classification Using Feature Selection and Regularized Methods with Sampling Methods
    Jyothi, Saddi
    Reddy, Y. Sowmya
    Lavanya, K.
    UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 351 - 358
  • [35] CFSES optimization Feature Selection with neural network classification for microarray data analysis
    Patra, Bichitrananda
    Bisoyi, Sudhansu Sekhar
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 45 - 50
  • [36] DNA microarray data analysis: Effective feature selection for accurate cancer classification
    Patra, Jagdish C.
    Lim, Goh P.
    Meher, Pramod K.
    Ang, Ee Luang
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 260 - 265
  • [37] Combination of Feature Selection Methods for the Effective Classification of Microarray Gene Expression Data
    Sheela, T.
    Rangarajan, Lalitha
    RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 137 - 145
  • [38] A Novel PSO-FLANN Framework of Feature Selection and Classification for Microarray Data
    Parhi, Pournamasi
    Mishra, Debahuti
    Mishra, Sashikala
    Shaw, Kailash
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1644 - 1649
  • [39] PRIVACY PRESERVING FEATURE SELECTION AND MULTICLASS CLASSIFICATION FOR HORIZONTALLY DISTRIBUTED DATA
    Lu, Yunmei
    Yan, Mingyuan
    Han, Meng
    Yang, Qingliang
    Zhang, Yanqing
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2018, 1 (04): : 331 - 348
  • [40] Distributed Fuzzy Cognitive Maps for Feature Selection in Big Data Classification
    Haritha, K.
    Judy, M., V
    Papageorgiou, Konstantinos
    Georgiannis, Vassilis C.
    Papageorgiou, Elpiniki
    ALGORITHMS, 2022, 15 (10)