LD: Low-Overhead GPU Race Detection Without Access Monitoring

被引:10
|
作者
Li, Pengcheng [1 ]
Hu, Xiaoyu [1 ]
Chen, Dong [1 ]
Brock, Jacob [1 ]
Luo, Hao [1 ]
Zhang, Eddy Z. [2 ]
Ding, Chen [1 ]
机构
[1] Univ Rochester, POB 270226,CSB Bldg, Rochester, NY 14627 USA
[2] Rutgers State Univ, Dept Comp Sci, 110 Frelinghuysen Rd, Piscataway, NJ 08854 USA
基金
美国国家科学基金会;
关键词
GPU race detection; low overhead; value-based checking; instrumentation-free;
D O I
10.1145/3046678
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data race detection has become an important problem in GPU programming. Previous designs of CPU racechecking tools are mainly task parallel and incur high overhead on GPUs due to access instrumentation, especially when monitoring many thousands of threads routinely used by GPU programs. This article presents a novel data-parallel solution designed and optimized for the GPU architecture. It includes compiler support and a set of runtime techniques. It uses value-based checking, which detects the races reported in previous work, finds new races, and supports race-free deterministic GPU execution. More important, race checking is massively data parallel and does not introduce divergent branching or atomic synchronization. Its slowdown is less than 5x for over half of the tests and 10x on average, which is orders of magnitude more efficient than the cuda-memcheck tool by Nvidia and the methods that use fine-grained access instrumentation.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Low-Overhead Detection of Memory Access Patterns and Their Time Evolution
    Servat, Harald
    Llort, German
    Gonzalez, Juan
    Gimenez, Judit
    Labarta, Jesus
    [J]. EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 57 - 69
  • [2] LoGA: Low-overhead GPU accounting using events
    Kehne, Jens
    Spassov, Stanislav
    Hillenbrand, Marius
    Rittinghaus, Marc
    Bellosa, Frank
    [J]. SYSTOR'17: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, 2017,
  • [3] Towards Providing Low-Overhead Data Race Detection for Large OpenMP Applications
    Protze, Joachim
    Atzeni, Simone
    Ahn, Dong H.
    Schulz, Martin
    Gopalakrishnan, Ganesh
    Mueller, Matthias S.
    Laguna, Ignacio
    Rakamaric, Zvonimir
    Lee, Greg L.
    [J]. PROCEEDINGS OF LLVM-HPC 14 2014 LLVM COMPILER INFRASTRUCTURE IN HPC, 2014, : 40 - 47
  • [4] Low-Overhead Trace Collection and Profiling on GPU Compute Kernels
    Darche, Sebastien
    Dagenais, Michel R.
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2024, 11 (02)
  • [5] Distop: A low-overhead cluster monitoring system
    Andresen, D
    Schopf, N
    Bowker, E
    Bower, T
    [J]. PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1832 - 1836
  • [6] ObfusMem: A Low-Overhead Access Obfuscation for Trusted Memories
    Awad, Amro
    Wang, Yipeng
    Shands, Deborah
    Solihin, Yan
    [J]. 44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 107 - 119
  • [7] TripleID: A Low-Overhead Representation and Querying Using GPU for Large RDFs
    Chantrapornchai, Chantana
    Choksuchat, Chidchanok
    Haidl, Michael
    Gorlatch, Sergei
    [J]. BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2016, 2016, 613 : 400 - 415
  • [8] GRace: A Low-Overhead Mechanism for Detecting Data Races in GPU Programs
    Zheng, Mai
    Ravi, Vignesh T.
    Qin, Feng
    Agrawal, Gagan
    [J]. ACM SIGPLAN NOTICES, 2011, 46 (08) : 135 - 145
  • [9] TAUoverSupermon; Low-overhead Online parallel performance monitoring
    Nataraj, Aroon
    Sottile, Matthew
    Morris, Alan
    Malony, Allen D.
    Shende, Sameer
    [J]. EURO-PAR 2007 PARALLEL PROCESSING, PROCEEDINGS, 2007, 4641 : 85 - +
  • [10] Low-Overhead Evaluation of Multiuser Detection Performance for Physical-Layer Multiple Access Systems
    Han, Fengxia
    Zhao, Shengjie
    Jiang, Hao
    Chen, Hong
    Zhang, Chenxi
    [J]. IEEE ACCESS, 2020, 8 : 20537 - 20545