An efficient sequential consistency implementation with dynamic race detection for GPUs

被引:0
|
作者
Tabbakh, Abdulaziz [1 ]
Annavaram, Murali [2 ]
机构
[1] King Fahd Univ Petr & Minerals, Comp Engn Dept, POB 5065, Dhahran 31261, Saudi Arabia
[2] Univ Southern Calif, Elect Engn Dept, 3740 Mlintock Ave, Los Angeles, CA 90089 USA
关键词
Computer architecture; GPU; Memory coherence; Sequential consistency; COHERENCE; OVERHEAD;
D O I
10.1016/j.jpdc.2023.104836
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As GPUs are being used for general purpose computations, applications with different memory access requirements have emerged. In spite of the growing demand, only few GPU coherence protocols and memory models have been explored in research, and even fewer models have been implemented in products. However, in the CPU domain a diverse range of memory models for parallel programming have been proposed, which explore the interplay between performance and programmability. Sequential consistency (SC) is one of the strict memory models. It provides the most programmer intuitive execution of memory operation but it imposes strict ordering restrictions on memory operations that cause performance overhead. Hence, implementing and supporting SC is one of the most challenging tasks in any computing platform, and GPUs are no exception. As such in this paper, we propose a GPU architecture that implements SC memory model with minimal performance and power overhead. We achieve this goal by designing a mechanism to detect races between different streaming multiprocessors (SMs) dynamically at runtime. The race is detected using a signature -based mechanism to keep track of sets of unseen updates for each SM which significantly reduces the hardware implementation cost, with a small increase in invalidation traffic. Our experiments show that dynamic race detection can be used to implement sequential consistency with 5% performance overhead.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Efficient Pattern Matching on GPUs for Intrusion Detection Systems
    Tumeo, Antonino
    Villa, Oreste
    Sciuto, Donatella
    [J]. PROCEEDINGS OF THE 2010 COMPUTING FRONTIERS CONFERENCE (CF 2010), 2010, : 87 - 88
  • [32] A distributed implementation of sequential consistency with multi-object operations
    Raynal, M
    Vidyasankar, K
    [J]. 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2004, : 544 - 551
  • [33] Efficient Race Detection with Futures
    Utterback, Robert
    Agrawal, Kunal
    Fineman, Jeremy
    Lee, I-Ting Angelina
    [J]. PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 340 - 354
  • [34] Design and Implementation of Dynamic and Efficient Web Crawler for XSS Vulnerability Detection
    Chai, Ao
    [J]. PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY (ICMMCT 2017), 2017, 126 : 1169 - 1176
  • [35] Efficient shallow water simulations on GPUs: Implementation, visualization, verification, and validation
    Brodtkorb, Andre R.
    Saetra, Martin L.
    Altinakar, Mustafa
    [J]. COMPUTERS & FLUIDS, 2012, 55 : 1 - 12
  • [36] Volition: Scalable and Precise Sequential Consistency Violation Detection
    Qian, Xuehai
    Torrellas, Josep
    Sahelices, Benjamin
    Qian, Depei
    [J]. ACM SIGPLAN NOTICES, 2013, 48 (04) : 535 - 548
  • [37] Verifying Dynamic Race Detection
    Mansky, William
    Peng, Yuanfeng
    Zdancewic, Steve
    Devietti, Joseph
    [J]. PROCEEDINGS OF THE 6TH ACM SIGPLAN CONFERENCE ON CERTIFIED PROGRAMS AND PROOFS, CPP'17, 2017, : 151 - 163
  • [38] Efficient Verification of Periodic Programs using Sequential Consistency and Snapshots
    Chaki, Sagar
    Gurfinkel, Arie
    Sinha, Nishant
    [J]. 2014 FORMAL METHODS IN COMPUTER-AIDED DESIGN (FMCAD), 2014, : 51 - 58
  • [39] Towards an Efficient Implementation of Sequential Montgomery Multiplication
    Neto, Joao Carlos
    Tenca, Alexandre Ferreira
    Ruggiero, Wilson Vicente
    [J]. 2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 1680 - 1684
  • [40] DYNAMIC PPROGRAMMING, SEQUENTIAL ESTIMATION AND SEQUENTIAL DETECTION PROCESSES
    BELLMAN, R
    MIDDLETON, D
    KALABA, R
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1961, 47 (03) : 338 - +