coreSNP: Parallel Processing of Microarray Data

被引:21
|
作者
Guzzi, Pietro Hiram [1 ]
Agapito, Giuseppe [1 ]
Cannataro, Mario [1 ,2 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, Catanzaro, Italy
[2] ICAR CNR, Arcavacata Di Rende, Italy
关键词
Bioinformatics (genome or protein) databases; medical information systems; health care; healthcare; distributed programming; statistical software; distributed systems; SINGLE NUCLEOTIDE POLYMORPHISMS; GENES; ASSOCIATION; PLATFORM;
D O I
10.1109/TC.2013.176
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The availability of high-throughput technologies, such as next generation sequencing and microarray, and the diffusion of genomics studies to large populations are producing an increasing amount of experimental data. In particular, pharmacogenomics studies the impact of genetic variation on drug response in patients and correlates gene expression or single nucleotide polymorphisms (SNPs) with the toxicity or efficacy of a drug, with the aim to improve drug therapy with respect to the patients' genotype ensuring maximum efficacy with minimal adverse effects. However, the storage, preprocessing, and analysis of experimental data are becoming a main bottleneck in the pharmacogenomics analysis pipeline, due to the increasing number of genes and patients investigated. This paper presents a new parallel software tool named coreSNP for the parallel preprocessing and statistical analysis of DMET (Drug Metabolism Enzymes and Transporters) SNP microarray data produced by Affymetrix for pharmacogenomics studies. The scalable multi-threaded implementation of coreSNP allows to handle the huge volumes of experimental pharmacogenomics data in a very efficient way, while its easy to use graphical user interface and its ability to annotate significant SNPs allow biologists to interpret the results easily. Performance evaluation conducted using real datasets shows good speed-up and scalability and effective response times.
引用
收藏
页码:2961 / 2974
页数:14
相关论文
共 50 条
  • [31] Parallel processing of continuous data streams
    Buza, A
    INES 2005: 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS, 2005, : 225 - 227
  • [32] Pipeline synchronization in data parallel processing
    Zhang, GS
    Li, XM
    CHINESE SCIENCE BULLETIN, 1996, 41 (02): : 163 - 168
  • [33] Pipeline synchronization in data parallel processing
    张冠松
    李晓明
    Chinese Science Bulletin, 1996, (02) : 163 - 168
  • [34] Data Processing Algorithm for Parallel Computing
    Barabanov, Igor
    Barabanova, Elizaveta
    Maltseva, Natalia
    Kvyatkovskaya, Irina
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, JCKBSE 2014, 2014, 466 : 61 - 69
  • [35] Cloud Parallel Genetic Algorithm for Gene Microarray Data Analysis
    Palomino, Rommel A. Benites
    Liang, Lily R.
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 932 - 933
  • [36] Parallel Point Symmetry Based Clustering for Gene Microarray Data
    Sarkar, Anasua
    Maulik, Ujjwa
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 351 - 354
  • [37] Gene ordering in microarray data using parallel memetic algorithms
    Mendes, A
    Cotta, C
    Garcia, V
    França, P
    Moscato, P
    2005 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2005, : 604 - 611
  • [38] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [39] Parallel Data Processing for Sparse Data Tomography Sensors
    Ceballos, Cantoral J. A.
    Ozanyan, K. B.
    2011 IEEE SENSORS, 2011, : 1656 - 1660
  • [40] Parallel processing of layout data with selective data distribution
    Pereira, Mark
    Bhat, Nitin
    Srinivas, Preethi
    PHOTOMASK TECHNOLOGY 2006, PTS 1 AND 2, 2006, 6349