OpenCL performance portability for general-purpose computation on graphics processor units: an exploration on cryptographic primitives

被引:11
|
作者
Agosta, Giovanni [1 ]
Barenghi, Alessandro [1 ]
Di Federico, Alessandro [1 ]
Pelosi, Gerardo [1 ]
机构
[1] Politecn Milan, DEIB, I-20133 Milan, Italy
来源
关键词
heterogeneous architectures; OpenCL; performance portability; implementation space exploration; IMPLEMENTATION;
D O I
10.1002/cpe.3358
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The modern trend toward heterogeneous many-core architectures has led to high architectural diversity in both high performance and high-end embedded systems. To effectively exploit the computational resources of such a wide range of architectures, programming languages and APIs such as OpenCL have become increasingly popular. Although OpenCL provides functional code portability and the ability to fine tune the application to the target hardware, providing performance portability is still an open problem. Thus, many research works have investigated the optimization of specific combinations of application and target platform. In this paper, we aim at leveraging the experience obtained in the implementation of algorithms from the cryptography domain to provide a set of guidelines for modern many-core heterogeneous architecture performance portability and to establish a base on which domain-specific languages and compiler transformations could be built in the near future. We study algorithmic choices and the effect of compiler transformations on three representative applications in the chosen domain on a set of seven target platforms. To estimate how well the application fits the architecture, we define a metric of computational intensity both for the architecture and the application implementation. Besides being useful to compare either different implementation or algorithmic choices and their fitness to a specific architecture, it can also be useful to the compiler to guide the code optimization process. Copyright (C) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:3633 / 3660
页数:28
相关论文
共 50 条
  • [31] A Data-Traffic Aware Dynamic Power Management for General-Purpose Graphics Processing Units
    Chiou, Lih-Yih
    Yang, Chao-Kai
    Chang, Che-Pin
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [32] Performance of H.26L Video Encoder on General-Purpose Processor
    Ville Lappalainen
    Antti Hallapuro
    Timo D. Hämäläinen
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2003, 34 : 239 - 249
  • [33] Performance of H.26L video encoder on general-purpose processor
    Lappalainen, V
    Hallapuro, A
    Hämäläinen, TD
    [J]. ICCE: 2001 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2001, : 266 - 267
  • [34] Performance of an advanced video codec on a general-purpose processor with media ISA extensions
    Lappalainen, V
    Defee, P
    Hallapuro, A
    [J]. IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - 2000 DIGEST OF TECHNICAL PAPERS, 2000, : 318 - 319
  • [35] Performance of H.26L video encoder on general-purpose processor
    Lappalainen, V
    Hallapuro, A
    Hämäläinen, TD
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 34 (03): : 239 - 249
  • [36] Performance of an advanced video codec on a general-purpose processor with media ISA extensions
    Lappalainen, V
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2000, 46 (03) : 706 - 716
  • [37] Optimization of Multiple-Rendezvous Low-Thrust Missions on General-Purpose Graphics Processing Units
    Massari, Mauro
    Wittig, Alexander
    [J]. JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2016, 13 (02): : 80 - 92
  • [38] Comprehensive regression-based model to predict performance of general-purpose graphics processing unit
    Shafiabadi, Mohammad Hossein
    Pedram, Hossein
    Reshadi, Midia
    Reza, Akram
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 1505 - 1516
  • [39] Comprehensive regression-based model to predict performance of general-purpose graphics processing unit
    Mohammad Hossein Shafiabadi
    Hossein Pedram
    Midia Reshadi
    Akram Reza
    [J]. Cluster Computing, 2020, 23 : 1505 - 1516
  • [40] THE INTEL I860 64-BIT PROCESSOR - A GENERAL-PURPOSE CPU WITH 3D GRAPHICS CAPABILITIES
    GRIMES, J
    KOHN, L
    BHARADHWAJ, R
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1989, 9 (04) : 85 - 94