A taxonomy of accelerator architectures and their programming models

被引：17

作者：

Cascaval, C. ^{[1
]}

Chatterjee, S. ^{[2
,3
]}

Franke, H. ^{[5
]}

Gildea, K. J. ^{[4
]}

Pattnaik, P. ^{[5
]}

机构：

[1] Qualcomm Res, Santa Clara, CA 95051 USA

[2] IBM Syst & Technol Grp, Austin, TX USA

[3] RIACS, Mountain View, CA USA

[4] IBM Syst & Technol Grp, Yorktown Hts, NY 10598 USA

[5] IBM Res Div, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA

来源：

IBM JOURNAL OF RESEARCH AND DEVELOPMENT | 2010年 / 54卷 / 05期

关键词：

D O I：

10.1147/JRD.2010.2059721

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

As the clock frequency of silicon chips is leveling off, the computer architecture community is looking for different solutions to continue application performance scaling. One such solution is the multicore approach, i.e., using multiple simple cores that enable higher performance than wide superscalar processors, provided that the workload can exploit the parallelism. Another emerging alternative is the use of customized designs (accelerators) at different levels within the system. These are specialized functional units integrated with the core, specialized cores, attached processors, or attached appliances. The design tradeoff is quite compelling because current processor chips have billions of transistors, but they cannot all be activated or switched at the same time at high frequencies. Specialized designs provide increased power efficiency but cannot be used as general-purpose compute engines. Therefore, architects trade area for power efficiency by placing in the design additional units that are known to be active at different times. The resulting system is a heterogeneous architecture, with the potential of specialized execution that accelerates different workloads. While designing and building such hardware systems is attractive, writing and porting software to a heterogeneous platform is even more challenging than parallelism for homogeneous multicore systems. In this paper, we propose a taxonomy that allows us to define classes of accelerators, with the goal of focusing on a small set of programming models for accelerators. We discuss several types of currently popular accelerators and identify challenges to exploiting such accelerators in current software stacks. This paper serves as a guide for both hardware designers by providing them with a view on how software best exploits specialization and software programmers by focusing research efforts to address parallelism and heterogeneity.

引用

页数：10

共 50 条

[31] ARCHITECTURES, PROGRAMMING AND PERFORMANCE OF SUPERCOMPUTERS
VOLKERT, J
KERNTECHNIK, 1988, 52 (02) : 112 - 119
[32] Parallel Programming for Heterogeneous Architectures
Krammer, Bettina
Mix, Hartmut
Geimer, Markus
PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 731 - 732
[33] A TAXONOMY OF MODELS
REDGRAVE, MJ
OPERATIONS RESEARCH, 1961, 9 : B40 - B40
[34] A Comparative Study and Evaluation of Parallel Programming Models for Shared-Memory Parallel Architectures
Miguel Sanchez, Luis
Fernandez, Javier
Sotomayor, Rafael
Escolar, Soledad
Daniel Garcia, J.
NEW GENERATION COMPUTING, 2013, 31 (03) : 139 - 161
[35] Power Monitoring with PAPI for Extreme Scale Architectures and Dataflow-based Programming Models
McCraw, Heike
Ralph, James
Danalis, Anthony
Dongarra, Jack
2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 385 - 391
[36] A Methodology Approach to Compare Performance of Parallel Programming Models for Shared-Memory Architectures
Utrera, Gladys
Gil, Marisa
Martorell, Xavier
NUMERICAL COMPUTATIONS: THEORY AND ALGORITHMS, PT I, 2020, 11973 : 318 - 325
[37] Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
Shan, Baodi
Araya-Polo, Mauricio
Chapman, Barbara
ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 126 - 140
[38] A Comparative Study and Evaluation of Parallel Programming Models for Shared-Memory Parallel Architectures
Luis Miguel Sanchez
Javier Fernandez
Rafael Sotomayor
Soledad Escolar
J. Daniel. Garcia
New Generation Computing, 2013, 31 : 139 - 161
[39] HotTiles: Accelerating SpMM with Heterogeneous Accelerator Architectures
Gerogiannis, Gerasimos
Aananthakrishnan, Sriram
Torrellas, Josep
Hur, Ibrahim
2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024, 2024, : 1012 - 1028
[40] MachSuite: Benchmarks for Accelerator Design and Customized Architectures
Reagen, Brandon
Adolf, Robert
Shao, Yakun Sophia
Wei, Gu-Yeon
Brooks, David
2014 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2014, : 110 - 119

← 1 2 3 4 5 →