Influence of optimization techniques on software performance for subsequent generations of CUDA architecture

被引:1
|
作者
Gambrych, Jacek [1 ]
机构
[1] Warsaw Univ Technol, Inst Elect Syst, Warsaw, Poland
来源
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021) | 2021年
关键词
Parallel processing; GPGPU; CUDA; optimization techniques;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00140
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
From the very beginning of the CUDA technology, it was essential to apply state-of-the-art optimization techniques. Only then was it possible to fully utilize the enormous computational power of graphic processing units. However, with the development of the CUDA architecture, the impact of typical optimization techniques on software performance has changed significantly. This article shows how the impact of several optimization techniques on the performance of the image filtering algorithm has changed for the subsequent generations of CUDA architecture. Then, based on the results obtained, it attempts to answer whether tedious and time-consuming optimization of the CUDA software is still necessary.
引用
收藏
页码:1002 / 1009
页数:8
相关论文
共 50 条
  • [31] Application of clustering techniques to software component architecture design
    Lo, SC
    Chang, JH
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2004, 14 (04) : 429 - 439
  • [32] Software architecture recovery and restructuring through clustering techniques
    Nortel, Ottawa, Ont, Canada
    Int Software Archit Workshop Proc ISAW, (101-104):
  • [33] Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA
    Ryoo, Shane
    Rodrigues, Christopher I.
    Baghsorkhi, Sara S.
    Stone, Sam S.
    Kirk, David B.
    Hwu, Wen-mei W.
    PPOPP'08: PROCEEDINGS OF THE 2008 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2008, : 73 - 82
  • [34] Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA)
    Liheng Jian
    Cheng Wang
    Ying Liu
    Shenshen Liang
    Weidong Yi
    Yong Shi
    The Journal of Supercomputing, 2013, 64 : 942 - 967
  • [35] Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA)
    Jian, Liheng
    Wang, Cheng
    Liu, Ying
    Liang, Shenshen
    Yi, Weidong
    Shi, Yong
    JOURNAL OF SUPERCOMPUTING, 2013, 64 (03): : 942 - 967
  • [36] Mathematical techniques & software for stochastic design optimization
    Parks, JM
    Li, C
    PROBABILISTIC MECHANICS & STRUCTURAL RELIABILITY: PROCEEDINGS OF THE SEVENTH SPECIALTY CONFERENCE, 1996, : 118 - 121
  • [37] Influence of Program Architecture on Software Quality Attributes
    Mzyk, Rafal
    Paszkiel, Szczepan
    CONTROL, COMPUTER ENGINEERING AND NEUROSCIENCE, 2021, 1362 : 322 - 329
  • [38] Performance enhancement techniques for InfiniBand™ architecture
    Kim, EJ
    Yum, KH
    Das, CR
    Yousif, M
    Duato, J
    NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 253 - 262
  • [39] Performance Analysis of Protein Structure Clustering Techniques and CUDA Implementation of RMSD Computation
    Kunhi, Luibaiba Muhammad
    Raju, K.
    Chiplunkar, Niranjan N.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING, VLSI, ELECTRICAL CIRCUITS AND ROBOTICS (DISCOVER), 2016, : 18 - 21
  • [40] Optimization of HEP codes on GPUs: Applying NVIDIA's CUDA computing model to HEP software
    Al-Turany M.
    The European Physical Journal Plus, 126 (1) : 1 - 9