A taxonomy of accelerator architectures and their programming models

被引:17
|
作者
Cascaval, C. [1 ]
Chatterjee, S. [2 ,3 ]
Franke, H. [5 ]
Gildea, K. J. [4 ]
Pattnaik, P. [5 ]
机构
[1] Qualcomm Res, Santa Clara, CA 95051 USA
[2] IBM Syst & Technol Grp, Austin, TX USA
[3] RIACS, Mountain View, CA USA
[4] IBM Syst & Technol Grp, Yorktown Hts, NY 10598 USA
[5] IBM Res Div, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
10.1147/JRD.2010.2059721
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the clock frequency of silicon chips is leveling off, the computer architecture community is looking for different solutions to continue application performance scaling. One such solution is the multicore approach, i.e., using multiple simple cores that enable higher performance than wide superscalar processors, provided that the workload can exploit the parallelism. Another emerging alternative is the use of customized designs (accelerators) at different levels within the system. These are specialized functional units integrated with the core, specialized cores, attached processors, or attached appliances. The design tradeoff is quite compelling because current processor chips have billions of transistors, but they cannot all be activated or switched at the same time at high frequencies. Specialized designs provide increased power efficiency but cannot be used as general-purpose compute engines. Therefore, architects trade area for power efficiency by placing in the design additional units that are known to be active at different times. The resulting system is a heterogeneous architecture, with the potential of specialized execution that accelerates different workloads. While designing and building such hardware systems is attractive, writing and porting software to a heterogeneous platform is even more challenging than parallelism for homogeneous multicore systems. In this paper, we propose a taxonomy that allows us to define classes of accelerators, with the goal of focusing on a small set of programming models for accelerators. We discuss several types of currently popular accelerators and identify challenges to exploiting such accelerators in current software stacks. This paper serves as a guide for both hardware designers by providing them with a view on how software best exploits specialization and software programmers by focusing research efforts to address parallelism and heterogeneity.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] ARCHITECTURES, PROGRAMMING AND PERFORMANCE OF SUPERCOMPUTERS
    VOLKERT, J
    KERNTECHNIK, 1988, 52 (02) : 112 - 119
  • [32] Parallel Programming for Heterogeneous Architectures
    Krammer, Bettina
    Mix, Hartmut
    Geimer, Markus
    PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 731 - 732
  • [33] A TAXONOMY OF MODELS
    REDGRAVE, MJ
    OPERATIONS RESEARCH, 1961, 9 : B40 - B40
  • [34] A Comparative Study and Evaluation of Parallel Programming Models for Shared-Memory Parallel Architectures
    Miguel Sanchez, Luis
    Fernandez, Javier
    Sotomayor, Rafael
    Escolar, Soledad
    Daniel Garcia, J.
    NEW GENERATION COMPUTING, 2013, 31 (03) : 139 - 161
  • [35] Power Monitoring with PAPI for Extreme Scale Architectures and Dataflow-based Programming Models
    McCraw, Heike
    Ralph, James
    Danalis, Anthony
    Dongarra, Jack
    2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 385 - 391
  • [36] A Methodology Approach to Compare Performance of Parallel Programming Models for Shared-Memory Architectures
    Utrera, Gladys
    Gil, Marisa
    Martorell, Xavier
    NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS, PT I, 2020, 11973 : 318 - 325
  • [37] Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
    Shan, Baodi
    Araya-Polo, Mauricio
    Chapman, Barbara
    ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 126 - 140
  • [38] A Comparative Study and Evaluation of Parallel Programming Models for Shared-Memory Parallel Architectures
    Luis Miguel Sanchez
    Javier Fernandez
    Rafael Sotomayor
    Soledad Escolar
    J. Daniel. Garcia
    New Generation Computing, 2013, 31 : 139 - 161
  • [39] HotTiles: Accelerating SpMM with Heterogeneous Accelerator Architectures
    Gerogiannis, Gerasimos
    Aananthakrishnan, Sriram
    Torrellas, Josep
    Hur, Ibrahim
    2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024, 2024, : 1012 - 1028
  • [40] MachSuite: Benchmarks for Accelerator Design and Customized Architectures
    Reagen, Brandon
    Adolf, Robert
    Shao, Yakun Sophia
    Wei, Gu-Yeon
    Brooks, David
    2014 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2014, : 110 - 119