Large-scale genome-wide association studies on a GPU cluster using a CUDA-accelerated PGAS programming model

被引:10
|
作者
Gonzalez-Dominguez, Jorge [1 ]
Kaessens, Jan Christian [2 ]
Wienbrandt, Lars [2 ]
Schmidt, Bertil [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Parallel & Distributed Architectures Grp, D-55128 Mainz, Germany
[2] Univ Kiel, Dept Comp Sci, Kiel, Germany
基金
英国惠康基金;
关键词
PGAS; CUDA; GPU; UPC plus; bioinformatics;
D O I
10.1177/1094342015585846
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting epistasis, such as 2-SNP interactions, in genome-wide association studies (GWAS) is an important but time consuming operation. Consequently, GPUs have already been used to accelerate these studies, reducing the runtime for moderately-sized datasets to less than 1 hour. However, single-GPU approaches cannot perform large-scale GWAS in reasonable time. In this work we present multiEpistSearch, a tool to detect epistasis that works on GPU clusters. While CUDA is used for parallelization within each GPU, the workload distribution among GPUs is performed with Unified Parallel C++ (UPC++), a novel extension of C++ that follows the Partitioned Global Address Space (PGAS) model. multiEpistSearch is able to analyze large-scale datasets with 5 million SNPs from 10,000 individuals in less than 3 hours using 24 NVIDIA GTX Titans.
引用
收藏
页码:506 / 510
页数:5
相关论文
共 50 条
  • [1] Genome-wide association studies and large-scale collaborations in epidemiology
    Psaty, Bruce M.
    Hofman, Albert
    [J]. EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2010, 25 (08) : 525 - 529
  • [2] Genome-wide association studies and large-scale collaborations in epidemiology
    Bruce M. Psaty
    Albert Hofman
    [J]. European Journal of Epidemiology, 2010, 25 : 525 - 529
  • [3] Secure large-scale genome-wide association studies using homomorphic encryption
    Blatt, Marcelo
    Gusev, Alexander
    Polyakov, Yuriy
    Goldwasser, Shafi
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (21) : 11608 - 11613
  • [4] Large-scale Object Recognition with CUDA-accelerated Hierarchical Neural Networks
    Uetz, Rafael
    Behnke, Sven
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 536 - 541
  • [5] CUDA-LR : CUDA-accelerated Logistic Regression Analysis Tool for Gene-Gene Interaction for Genome-Wide Association Study
    Lee, Sungyoung
    Kwon, Min-Seok
    Huh, Ik-Soo
    Park, Taesung
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 691 - 695
  • [6] Large-Scale Genome-Wide Association Studies Consortia Blessing, Burden, or Necessity?
    Ingelsson, Erik
    [J]. CIRCULATION-CARDIOVASCULAR GENETICS, 2010, 3 (05) : 396 - 398
  • [7] A new way to protect privacy in large-scale genome-wide association studies
    Kamm, Liina
    Bogdanov, Dan
    Laur, Sven
    Vilo, Jaak
    [J]. BIOINFORMATICS, 2013, 29 (07) : 886 - 893
  • [8] Conditional Random Fields for Fast, Large-Scale Genome-Wide Association Studies
    Huang, Jim C.
    Meek, Christopher
    Kadie, Carl
    Heckerman, David
    [J]. PLOS ONE, 2011, 6 (07):
  • [9] Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes
    Bi, Wenjian
    Zhou, Wei
    Dey, Rounak
    Mukherjee, Bhramar
    Sampson, Joshua N.
    Lee, Seunggeun
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (05) : 825 - 839
  • [10] GPU-Accelerated Large-Scale Genome Assembly
    Goswami, Sayan
    Lee, Kisung
    Shams, Shayan
    Park, Seung-Jong
    [J]. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 814 - 824