Parallel classification and feature selection in microarray data using SPRINT

被引:10
|
作者
Mitchell, Lawrence [1 ]
Sloan, Terence M. [1 ]
Mewissen, Muriel [2 ]
Ghazal, Peter [2 ]
Forster, Thorsten [2 ]
Piotrowski, Michal [1 ]
Trew, Arthur [1 ]
机构
[1] Univ Edinburgh, Sch Phys & Astron, EPCC, Edinburgh EH9 3JZ, Midlothian, Scotland
[2] Univ Edinburgh, Sch Med, Div Pathway Med, Edinburgh EH16 4SB, Midlothian, Scotland
来源
基金
英国生物技术与生命科学研究理事会; 英国惠康基金; 英国工程与自然科学研究理事会;
关键词
HIGH-DIMENSIONAL DATA; GENE-EXPRESSION; BIOINFORMATICS; SPACES;
D O I
10.1002/cpe.2928
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The statistical language R is favoured by many biostatisticians for processing microarray data. In recent times, the quantity of data that can be obtained in experiments has risen significantly, making previously fast analyses time consuming or even not possible at all with the existing software infrastructure. High performance computing (HPC) systems offer a solution to these problems but at the expense of increased complexity for the end user. The Simple Parallel R Interface is a library for R that aims to reduce the complexity of using HPC systems by providing biostatisticians with drop-in parallelised replacements of existing R functions. In this paper we describe parallel implementations of two popular techniques: exploratory clustering analyses using the random forest classifier and feature selection through identification of differentially expressed genes using the rank product method. Copyright © 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:854 / 865
页数:12
相关论文
共 50 条
  • [41] A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification
    Sun, Shiquan
    Peng, Qinke
    Shakoor, Adnan
    [J]. PLOS ONE, 2014, 9 (07):
  • [42] Feature Selection for Self-Supervised Classification With Applications to Microarray and Sequence Data
    Kung, Sun-Yuan
    Mak, Man-Wai
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2008, 2 (03) : 297 - 309
  • [43] CFSES optimization Feature Selection with neural network classification for microarray data analysis
    Patra, Bichitrananda
    Bisoyi, Sudhansu Sekhar
    [J]. 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 45 - 50
  • [44] Feature Selection and Classification of Microarray Data for Cancer Prediction Using MapReduce Implementation of Random Forest Algorithm
    Dhanalakshmi, R.
    Khaire, Utkarsh M.
    [J]. JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2019, 78 (03): : 158 - 161
  • [45] DNA microarray data analysis: Effective feature selection for accurate cancer classification
    Patra, Jagdish C.
    Lim, Goh P.
    Meher, Pramod K.
    Ang, Ee Luang
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 260 - 265
  • [46] A Novel PSO-FLANN Framework of Feature Selection and Classification for Microarray Data
    Parhi, Pournamasi
    Mishra, Debahuti
    Mishra, Sashikala
    Shaw, Kailash
    [J]. INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1644 - 1649
  • [47] Combination of Feature Selection Methods for the Effective Classification of Microarray Gene Expression Data
    Sheela, T.
    Rangarajan, Lalitha
    [J]. RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 137 - 145
  • [48] Support Vector Machine Ensembles Using Feature-Subset Selection for Enhancing Microarray Data Classification
    Ahmed, Eman
    El Gayar, Neamat
    El Azab, Iman A.
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2012, 28 (04): : 1 - 11
  • [49] Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Nitish Kumar
    Swain, Amitav
    Rath, Santanu Kumar
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 301 - 310
  • [50] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Zixuan Wang
    Yi Zhou
    Tatsuya Takagi
    Jiangning Song
    Yu-Shi Tian
    Tetsuo Shibuya
    [J]. BMC Bioinformatics, 24