Exploiting Direct Memory Operands in GPU Instructions

被引:0
|
作者
Mohammadpur-Fard, Ali [1 ]
Darabi, Sina [2 ,3 ]
Falahati, Hajar [2 ,4 ]
Mahani, Negin [2 ,4 ,5 ]
Sarbazi-Azad, Hamid [1 ,2 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran 111559466, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran 195383351, Iran
[3] Univ Svizzera Italiana USI, Fac Informat, CH-6900 Lugano, Switzerland
[4] Barcelona Supercomp Ctr BSC, Barcelona 08034, Spain
[5] Shahid Bahonar Univ, Dept Comp Engn, Higher Educ Complex Zarand, Kerman 761691411, Iran
关键词
Registers; Graphics processing units; Computer architecture; Reduced instruction set computing; Arithmetic; Hardware; Standards; CISC; GPGPU; RISC; register file;
D O I
10.1109/LCA.2024.3371062
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
GPUs are widely used for diverse applications, particularly data-parallel tasks like machine learning and scientific computing. However, their efficiency is hindered by architectural limitations, inherited from historical RISC processors, in handling memory loads causing high register file contention. We observe that a significant number (around 26%) of values present in the register file are typically used only once, contributing to more than 25% of the total register file bank conflicts, on average. This paper addresses the challenge of single-use memory values in the GPU register file (i.e. data values used only once) which wastes space and increases latency. To this end, we introduce a novel mechanism inspired by CISC architectures. It replaces single-use loads with direct memory operands in arithmetic operations. Our approach improves performance by 20% and reduces energy consumption by 18%, on average, with negligible (<1%) hardware overhead.
引用
收藏
页码:162 / 165
页数:4
相关论文
共 50 条
  • [1] GPU Register Packing: Dynamically Exploiting Narrow-Width Operands to Improve Performance
    Wang, Xin
    Zhang, Wei
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS / 11TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING / 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2017, : 745 - 752
  • [2] Exploiting GPU Direct Access to Non-Volatile Memory to Accelerate Big Data Processing
    Bayati, Mahsa
    Leeser, Miriam
    Mi, Ningfang
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [3] Characterizing and Exploiting Small-Value Memory Instructions
    Islam, Mafijul Md
    Stenstrom, Per
    IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (07) : 1640 - 1655
  • [4] Thorough Evaluation of GPU Shared Memory Load and Store Instructions
    Okamoto, Satoshi
    Ito, Yasuaki
    Nakano, Koji
    Bordim, Jacir L.
    2014 SECOND INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2014, : 614 - 616
  • [5] Tetris: Accelerating Sparse Convolution by Exploiting Memory Reuse on GPU
    Liu, Xiaoyan
    Zheng, Xuegui
    Yang, Hailong
    Luan, Zhongzhi
    Qian, Depei
    PROCEEDINGS OF THE 29TH ACM SIGPLAN ANNUAL SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, PPOPP 2024, 2024, : 229 - 242
  • [6] Exploiting GPU memory hierarchy for accelerating a specialized stencil computation
    Balaiah, Thanasekhar
    Parthasarathi, Ranjani
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (21):
  • [7] Methodology for the efficient use of operands in the design of compound instructions in ASIP
    Lee, Jongwon
    Ahn, Minwook
    Youn, Jonghee M.
    Paek, Yunheung
    2009 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2009), 2009, : 349 - 352
  • [8] Reducing the Delay for Decoding Instructions by Predicting Their Source Register Operands
    Park, Sanghyun
    Jun, Jaeyung
    Kim, Changhyun
    Min, Gyeong Il
    Lee, Hun Jae
    Kim, Seon Wook
    ELECTRONICS, 2020, 9 (05)
  • [9] EraseMe: A Defense Mechanism against Information Leakage exploiting GPU Memory
    Fang, Hongyu
    Doroslovacki, Milos
    Venkataramani, Guru
    GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 319 - 322
  • [10] Intentional Memory Instructions Direct Attention But Do Not Enhance Visual Memory
    Varakin, D. Alexander
    Hale, Jamie
    SAGE OPEN, 2014, 4 (04):