coreSNP: Parallel Processing of Microarray Data

被引:21
|
作者
Guzzi, Pietro Hiram [1 ]
Agapito, Giuseppe [1 ]
Cannataro, Mario [1 ,2 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, Catanzaro, Italy
[2] ICAR CNR, Arcavacata Di Rende, Italy
关键词
Bioinformatics (genome or protein) databases; medical information systems; health care; healthcare; distributed programming; statistical software; distributed systems; SINGLE NUCLEOTIDE POLYMORPHISMS; GENES; ASSOCIATION; PLATFORM;
D O I
10.1109/TC.2013.176
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The availability of high-throughput technologies, such as next generation sequencing and microarray, and the diffusion of genomics studies to large populations are producing an increasing amount of experimental data. In particular, pharmacogenomics studies the impact of genetic variation on drug response in patients and correlates gene expression or single nucleotide polymorphisms (SNPs) with the toxicity or efficacy of a drug, with the aim to improve drug therapy with respect to the patients' genotype ensuring maximum efficacy with minimal adverse effects. However, the storage, preprocessing, and analysis of experimental data are becoming a main bottleneck in the pharmacogenomics analysis pipeline, due to the increasing number of genes and patients investigated. This paper presents a new parallel software tool named coreSNP for the parallel preprocessing and statistical analysis of DMET (Drug Metabolism Enzymes and Transporters) SNP microarray data produced by Affymetrix for pharmacogenomics studies. The scalable multi-threaded implementation of coreSNP allows to handle the huge volumes of experimental pharmacogenomics data in a very efficient way, while its easy to use graphical user interface and its ability to annotate significant SNPs allow biologists to interpret the results easily. Performance evaluation conducted using real datasets shows good speed-up and scalability and effective response times.
引用
收藏
页码:2961 / 2974
页数:14
相关论文
共 50 条
  • [1] Parallel algorithm in microarray data processing
    Wang, XJ
    Zhou, GH
    Jia, S
    Hessner, MJ
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 43 - 48
  • [2] Parallel Pre-processing of Affymetrix Microarray Data
    Guzzi, Pietro Hiram
    Cannataro, Mario
    EURO-PAR 2010 PARALLEL PROCESSING WORKSHOPS, 2011, 6586 : 225 - 232
  • [3] Microarray data analysis and processing
    Wang, YY
    Zhang, YY
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2003, 30 (02) : 321 - 323
  • [4] Microarray data processing and analysis
    Dopazo, J
    METHODS OF MICROARRAY DATA ANALYSIS II, 2002, : 43 - 63
  • [5] A parallel processing approach for information extraction from microarray images
    Monwar, Md. Maruf
    Rezaei, Siamak
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 572 - +
  • [6] Parallel Processing of Genomics Data
    Agapito, Giuseppe
    Guzzi, Pietro Hiram
    Cannataro, Mario
    NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS (NUMTA-2016), 2016, 1776
  • [7] Workflows for microarray data processing in the Kepler environment
    Stropp, Thomas
    McPhillips, Timothy
    Ludaescher, Bertram
    Bieda, Mark
    BMC BIOINFORMATICS, 2012, 13
  • [8] Pre-processing Agilent microarray data
    Zahurak, Marianna
    Parmigiani, Giovanni
    Yu, Wayne
    Scharpf, Robert B.
    Berman, David
    Schaeffer, Edward
    Shabbeer, Shabana
    Cope, Leslie
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [9] Automated determination of optimal microarray data processing
    Scheuermann, Richard
    Kong, Y. Megan
    Wang, Siling
    Qian, Yu
    McGee, Monnie
    JOURNAL OF IMMUNOLOGY, 2010, 184
  • [10] Pre-processing Agilent microarray data
    Marianna Zahurak
    Giovanni Parmigiani
    Wayne Yu
    Robert B Scharpf
    David Berman
    Edward Schaeffer
    Shabana Shabbeer
    Leslie Cope
    BMC Bioinformatics, 8