Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison

被引:0
|
作者
Kuan, Lidia [1 ]
Neves, Joao [1 ]
Pratas, Frederico [2 ]
Tomas, Pedro [1 ]
Sousa, Leonel [1 ]
机构
[1] Univ Lisbon, IST, INESC ID, P-1699 Lisbon, Portugal
[2] Intel Barcelona Res Ctr, Intel Labs, Barcelona, Spain
关键词
MrBayes; CUDA; OpenACC; Phylogenetic Inference; MRBAYES; TREES; MODEL;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Phylogenetic inference is used to derive a "tree of life" for a collection of species whose DNA sequences are known. While several software packages have already been developed to take advantage of GPUs to accelerate phylogenetic inference, they typically require significant changes to the original code, constraining code maintenance. Recently, the OpenACC API was proposed to minimize the programming efforts on accelerator devices. In this work we evaluate the applicability of the OpenACC API for phylogenetic inference using the most recent MrBayes program (version 3.2.2). A new parallelization strategy is proposed that is specifically adapted to the latest version of MrBayes and minimizes the data transfers between the host (CPU) and the accelerating device (GPU). We further implement the proposed strategy using both the OpenACC and CUDA programming frameworks. Experimental results demonstrate that significant performance gains can be achieved using OpenACC with a reduced amount of programming effort. Comparing with state-of-art GPU's implementations, the proposed OpenACC and CUDA implementations achieve a performance gain of up to 5.2x and 5.7x, respectively. Experimental results indicate that with a reduced amount of programming effort, we achieve a performance that is only 10% inferior to one obtained with CUDA, which uses device specific optimizations.
引用
收藏
页码:589 / 600
页数:12
相关论文
共 50 条
  • [1] Accelerating Hydrocodes with OpenACC, OpenCL and CUDA
    Herdman, J. A.
    Gaudin, W. P.
    McIntosh-Smith, S.
    Boulton, M.
    Beckingsale, D. A.
    Mallinson, A. C.
    Jarvis, S. A.
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 465 - 471
  • [2] Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
    Jing Gong
    Stefano Markidis
    Erwin Laure
    Matthew Otten
    Paul Fischer
    Misun Min
    [J]. The Journal of Supercomputing, 2016, 72 : 4160 - 4180
  • [3] Nekbone performance on GPUs with OpenACC and CUDA Fortran implementations
    Gong, Jing
    Markidis, Stefano
    Laure, Erwin
    Otten, Matthew
    Fischer, Paul
    Min, Misun
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (11): : 4160 - 4180
  • [4] An Early Performance Comparison of CUDA and OpenACC
    Li, Xuechao
    Shih, Po-Chou
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON MEASUREMENT INSTRUMENTATION AND ELECTRONICS (ICMIE 2018), 2018, 208
  • [5] Accelerating geospatial analysis on GPUs using CUDA
    Yingjie XIA Li KUANG Xiumei LI Hangzhou Institute of Service EngineeringHangzhou Normal UniversityHangzhou China Department of AutomationSchool of Electronic Information and Electrical EngineeringShanghai Jiao Tong UniversityShanghai China Provincial Key Laboratory for Computer Information Processing TechnologySoochow UniversitySoochow China
    [J]. Journal of Zhejiang University-Science C(Computers & Electronics)., 2011, 12 (12) - 999
  • [6] Accelerating geospatial analysis on GPUs using CUDA
    Ying-jie XIA 1
    [J]. Frontiers of Information Technology & Electronic Engineering, 2011, (12) : 990 - 999
  • [7] Accelerating geospatial analysis on GPUs using CUDA
    Xia, Ying-jie
    Kuang, Li
    Li, Xiu-mei
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2011, 12 (12): : 990 - 999
  • [8] Accelerating geospatial analysis on GPUs using CUDA
    Ying-jie Xia
    Li Kuang
    Xiu-mei Li
    [J]. Journal of Zhejiang University SCIENCE C, 2011, 12 : 990 - 999
  • [9] Performance Comparison of CUDA and OpenACC Based on Optimizations
    Li, Xuechao
    Shih, Po-Chou
    [J]. PROCEEDINGS OF THE 2018 2ND HIGH PERFORMANCE COMPUTING AND CLUSTER TECHNOLOGIES CONFERENCE (HPCCT 2018), 2018, : 53 - 57
  • [10] Accelerating prediction of chemical shift of protein structures on GPUs: Using OpenACC
    Wright, Eric
    Ferrato, Mauricio H.
    Bryer, Alexander J.
    Searles, Robert
    Perilla, Juan R.
    Chandrasekaran, Sunita
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (05)