LineVul: A Transformer-based Line-Level Vulnerability Prediction

被引:135
|
作者
Fu, Michael [1 ]
Tantithamthavorn, Chakkrit [1 ]
机构
[1] Monash Univ, Clayton, Vic, Australia
基金
澳大利亚研究理事会;
关键词
D O I
10.1145/3524842.3528452
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software vulnerabilities are prevalent in software systems, causing a variety of problems including deadlock, information loss, or system failures. Thus, early predictions of software vulnerabilities are critically important in safety-critical software systems. Various ML/DL-based approaches have been proposed to predict vulnerabilities at the file/function/method level. Recently, IVDetect (a graph-based neural network) is proposed to predict vulnerabilities at the function level. Yet, the IVDetect approach is still inaccurate and coarse-grained. In this paper, we propose LINEVUL, a Transformer-based line-level vulnerability prediction approach in order to address several limitations of the state-of-the-art IVDetect approach. Through an empirical evaluation of a large-scale real-world dataset with 188k+ C/C++ functions, we show that LINEVUL achieves (1) 160%-379% higher F1-measure for function-level predictions; (2) 12%-25% higher Top-10 Accuracy for line-level predictions; and (3) 29%-53% less Effort@20%Recall than the baseline approaches, highlighting the significant advancement of LINEVUL towards more accurate and more cost-effective line-level vulnerability predictions. Our additional analysis also shows that our LINEVUL is also very accurate (75%-100%) for predicting vulnerable functions affected by the Top-25 most dangerous CWEs, highlighting the potential impact of our LINEVUL in real-world usage scenarios.
引用
收藏
页码:608 / 620
页数:13
相关论文
共 50 条
  • [1] A Line-Level Explainable Vulnerability Detection Approach for Java
    Mosolygó, Balázs
    Vándor, Norbert
    Hegedűs, Péter
    Ferenc, Rudolf
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13380 LNCS : 106 - 122
  • [2] LINEJLOCREPAIR: A line-level method for Automated Vulnerability Repair based on joint training
    Hou, Jing
    Han, Jiaxuan
    Huang, Cheng
    Wang, Nannan
    Li, Lerong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [3] A Line-Level Explainable Vulnerability Detection Approach for Java']Java
    Mosolygo, Balazs
    Vandor, Norbert
    Hegedus, Peter
    Ferenc, Rudolf
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV, 2022, 13380 : 106 - 122
  • [4] ActiveClean: Generating Line-Level Vulnerability Data via Active Learning
    Joshy, Ashwin Kallingal
    Alam, Mirza Sanjida
    Sharmin, Shaila
    Li, Qi
    Le, Wei
    arXiv, 2023,
  • [5] A transformer-based IDE plugin for vulnerability detection
    Mamede, Claudia
    Pinconschi, Eduard
    Abreu, Rui
    PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022, 2022,
  • [6] Transformer-Based Language Models for Software Vulnerability Detection
    Thapa, Chandra
    Jang, Seung Ick
    Ahmed, Muhammad Ejaz
    Camtepe, Seyit
    Pieprzyk, Josef
    Nepal, Surya
    PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 481 - 496
  • [7] DeepLineDP: Towards a Deep Learning Approach for Line-Level Defect Prediction
    Pornprasit, Chanathip
    Tantithamthavorn, Chakkrit
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (01) : 84 - 98
  • [8] Transformer-based structural seismic response prediction
    Zhang, Qingyu
    Guo, Maozi
    Zhao, Lingling
    Li, Yang
    Zhang, Xinxin
    Han, Miao
    STRUCTURES, 2024, 61
  • [9] Temporal fusion transformer-based prediction in aquaponics
    Metin, Ahmet
    Kasif, Ahmet
    Catal, Cagatay
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (17): : 19934 - 19958
  • [10] A Transformer-Based Framework for Geomagnetic Activity Prediction
    Abduallah, Yasser
    Wang, Jason T. L.
    Xu, Chunhui
    Wang, Haimin
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022), 2022, 13515 : 325 - 335