Detecting Data Races in OpenMP with Deep Learning and Large Language Models

被引:0
|
作者
Alsofyani, May [1 ]
Wang, Liqiang [1 ]
机构
[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
关键词
data race; race condition; bug detection; OpenMP; transformer encoder; large language model; CodeBERTa; GPT-4; Turbo;
D O I
10.1145/3677333.3678160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based neural network models are increasingly employed to handle software engineering issues, such as bug localization and program repair. These models, equipped with a self-attention mechanism, excel at understanding source code context and semantics. Recently, large language models (LLMs) have emerged as a promising alternative for analyzing and understanding code structure. In this paper, we propose two novel methods for detecting data race bugs in OpenMP programs. The first method is based on a transformer encoder trained from scratch. The second method leverages LLMs, specifically extending GPT-4 Turbo through the use of prompt engineering and fine-tuning techniques. For training and testing our approach, we utilized two datasets comprising different OpenMP directives. Our experiments show that the transformer encoder achieves competitive accuracy compared to LLMs, whether through fine-tuning or prompt engineering techniques. This performance may be attributed to the complexity of many OpenMP directives and the limited availability of labeled datasets.
引用
收藏
页码:96 / 103
页数:8
相关论文
共 50 条
  • [1] ARCHER: Effectively Spotting Data Races in Large OpenMP Applications
    Atzeni, Simone
    Gopalakrishnan, Ganesh
    Rakamaric, Zvonimir
    Ahn, Dong H.
    Laguna, Ignacio
    Schulz, Martin
    Lee, Gregory L.
    Protze, Joachim
    Mueller, Matthias S.
    Mueller, Matthias S.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 53 - 62
  • [2] A practical tool for detecting races in OpenMP programs
    Kim, YJ
    Park, MY
    Park, SH
    Jun, YK
    PARALLEL COMPUTING TECHNOLOGIES, 2005, 3606 : 321 - 330
  • [3] A Tool for Detecting First Races in OpenMP Programs
    Kang, Mun-Hye
    Ha, Ok-Kyoon
    Jun, Sang-Woo
    Jun, Yong-Kee
    PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2009, 5698 : 299 - +
  • [4] A Petri Nets Based Approach for Detecting the Data Races and Deadlocks in OpenMP Program
    Xian, Yulong
    Ding, Zhijun
    2012 THIRD INTERNATIONAL CONFERENCE ON THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE (ICTMF 2012), 2013, 38 : 229 - 237
  • [5] A comparison of scalable labeling schemes for detecting races in OpenMP programs
    Park, SH
    Park, MY
    Jun, YK
    OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2001, 2104 : 68 - 80
  • [6] Natural Language Interface for Data Visualization with Deep Learning Based Language Models
    Stoeckl, Andreas
    2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 142 - 148
  • [7] Neural Data Augmentation for Legal Overruling Task: Small Deep Learning Models vs. Large Language Models
    Sheik, Reshma
    Sundara, K. P. Siva
    Nirmala, S. Jaya
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [8] Neural Data Augmentation for Legal Overruling Task: Small Deep Learning Models vs. Large Language Models
    Reshma Sheik
    K. P. Siva Sundara
    S. Jaya Nirmala
    Neural Processing Letters, 56
  • [9] Automatic Kernel Generation for Large Language Models on Deep Learning Accelerators
    Wang, Fuyu
    Shen, Minghua
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [10] An empirical analysis of Intel Thread Checker for detecting races in OpenMP programs
    Kim, Young-Joo
    Kim, Daeyoung
    Jun, Yong-Kee
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 409 - +