Fine-grain parallelism using multi-core, Cell/BE, and GPU Systems

被引:17
|
作者
不详
机构
[1] Pratas, Frederico
[2] Trancoso, Pedro
[3] Sousa, Leonel
[4] Stamatakis, Alexandros
[5] Shi, Guochun
[6] Kindratenko, Volodymyr
关键词
Multi-core processors; Multi-core acelerators; Performance evaluation; Fine-grain parallelism; Scientific workloads; Database workloads; DNA-SEQUENCES; GRAPHICS; PERFORMANCE; INFERENCE; DYNAMICS;
D O I
10.1016/j.parco.2011.08.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Currently, we are facing a situation where applications exhibit increasing computational demands and where a large variety of parallel processor systems are available. In this paper we focus on exploiting fine-grain parallelism for three applications with distinct characteristics: a Bioinformatics application (MrBayes), a Molecular Dynamics application (NAMD), and a database application (TPC-H). We assess, side-by-side, the performance of the three applications on general-purpose multi-core processors, the Cell Broadband Engine (Cell/BE), and Graphics Processing Units (GPU). Our results indicate that application performance depends on the characteristics of the parallel architectures and on the computational requirements of the core functions of the respective applications. For MrBayes the best overall performance is achieved on general-purpose multi-core processors, for NAMD on the Cell/BE, and for TPC-H on GPUs. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:365 / 390
页数:26
相关论文
共 50 条
  • [21] Hybrid Parallelism for Volume Rendering on Large, Multi-core Systems
    Howison, Mark
    Bethel, E. Wes
    Childs, Hank
    NUMERICAL MODELING OF SPACE PLASMA FLOWS - ASTRONUM 2010, 2011, 444 : 275 - 280
  • [22] Data access partitioning for fine-grain parallelism on multicore architectures
    Chu, Michael
    Ravindran, Rajiv
    Mahlke, Scott
    MICRO-40: PROCEEDINGS OF THE 40TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2007, : 369 - +
  • [23] Algorithmic skeletons for multi-core, multi-GPU systems and clusters
    Ernsting, Steffen
    Kuchen, Herbert
    International Journal of High Performance Computing and Networking, 2012, 7 (02) : 129 - 138
  • [24] Fractal: An Execution Model for Fine-Grain Nested Speculative Parallelism
    Subramanian, Suvinay
    Jeffrey, Mark C.
    Abeydeera, Maleen
    Leed, Hyun Ryong
    Ying, Victor A.
    Emer, Joel
    Sanchez, Daniel
    44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 587 - 599
  • [25] The Importance of Efficient Fine-Grain Synchronization for Many-Core Systems
    Geng, Tongsheng
    Zuckerman, Stephane
    Monsalve, Jose
    Goldman, Alfredo
    Habib, Sami
    Gaudiot, Jean-Luc
    Gao, Guang R.
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2016, 2017, 10136 : 203 - 217
  • [26] EXPLOITING FINE-GRAIN PARALLELISM ON DATA-FLOW ARCHITECTURES
    GAO, GR
    PARALLEL COMPUTING, 1990, 13 (03) : 309 - 320
  • [27] Parallelism analysis: Precise WCET values for complex multi-core systems
    Kelter, Timon
    Marwedel, Peter
    SCIENCE OF COMPUTER PROGRAMMING, 2017, 133 : 175 - 193
  • [28] Parallelism Analysis: Precise WCET Values for Complex Multi-Core Systems
    Kelter, Timon
    Marwedel, Peter
    FORMAL TECHNIQUES FOR SAFETY-CRITICAL SYSTEMS, FTSCS 2014, 2015, 476 : 142 - 158
  • [29] DAG Scheduling and Analysis on Multi-Core Systems by Modelling Parallelism and Dependency
    Zhao, Shuai
    Dai, Xiaotian
    Bate, Iain
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 4019 - 4038
  • [30] Accelerating COBAYA3 on multi-core CPU and GPU systems using PARALUTION
    Trost, Nico
    Jimenez, Javier
    Lukarski, Dimitar
    Sanchez, Victor
    ANNALS OF NUCLEAR ENERGY, 2015, 82 : 252 - 259