LineVul: A Transformer-based Line-Level Vulnerability Prediction

被引：135

作者：

Fu, Michael ^{[1
]}

Tantithamthavorn, Chakkrit ^{[1
]}

机构：

[1] Monash Univ, Clayton, Vic, Australia

来源：

2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022) | 2022年

基金：

澳大利亚研究理事会;

关键词：

D O I：

10.1145/3524842.3528452

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Software vulnerabilities are prevalent in software systems, causing a variety of problems including deadlock, information loss, or system failures. Thus, early predictions of software vulnerabilities are critically important in safety-critical software systems. Various ML/DL-based approaches have been proposed to predict vulnerabilities at the file/function/method level. Recently, IVDetect (a graph-based neural network) is proposed to predict vulnerabilities at the function level. Yet, the IVDetect approach is still inaccurate and coarse-grained. In this paper, we propose LINEVUL, a Transformer-based line-level vulnerability prediction approach in order to address several limitations of the state-of-the-art IVDetect approach. Through an empirical evaluation of a large-scale real-world dataset with 188k+ C/C++ functions, we show that LINEVUL achieves (1) 160%-379% higher F1-measure for function-level predictions; (2) 12%-25% higher Top-10 Accuracy for line-level predictions; and (3) 29%-53% less Effort@20%Recall than the baseline approaches, highlighting the significant advancement of LINEVUL towards more accurate and more cost-effective line-level vulnerability predictions. Our additional analysis also shows that our LINEVUL is also very accurate (75%-100%) for predicting vulnerable functions affected by the Top-25 most dangerous CWEs, highlighting the potential impact of our LINEVUL in real-world usage scenarios.

引用

页码：608 / 620

页数：13

共 50 条

[1] A Line-Level Explainable Vulnerability Detection Approach for Java
Mosolygó, Balázs
Vándor, Norbert
Hegedűs, Péter
Ferenc, Rudolf
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13380 LNCS : 106 - 122
[2] LINEJLOCREPAIR: A line-level method for Automated Vulnerability Repair based on joint training
Hou, Jing
Han, Jiaxuan
Huang, Cheng
Wang, Nannan
Li, Lerong
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
[3] A Line-Level Explainable Vulnerability Detection Approach for Java']Java
Mosolygo, Balazs
Vandor, Norbert
Hegedus, Peter
Ferenc, Rudolf
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART IV, 2022, 13380 : 106 - 122
[4] ActiveClean: Generating Line-Level Vulnerability Data via Active Learning
Joshy, Ashwin Kallingal
Alam, Mirza Sanjida
Sharmin, Shaila
Li, Qi
Le, Wei
arXiv, 2023,
[5] A transformer-based IDE plugin for vulnerability detection
Mamede, Claudia
Pinconschi, Eduard
Abreu, Rui
PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022, 2022,
[6] Transformer-Based Language Models for Software Vulnerability Detection
Thapa, Chandra
Jang, Seung Ick
Ahmed, Muhammad Ejaz
Camtepe, Seyit
Pieprzyk, Josef
Nepal, Surya
PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 481 - 496
[7] DeepLineDP: Towards a Deep Learning Approach for Line-Level Defect Prediction
Pornprasit, Chanathip
Tantithamthavorn, Chakkrit
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (01) : 84 - 98
[8] Transformer-based structural seismic response prediction
Zhang, Qingyu
Guo, Maozi
Zhao, Lingling
Li, Yang
Zhang, Xinxin
Han, Miao
STRUCTURES, 2024, 61
[9] Temporal fusion transformer-based prediction in aquaponics
Metin, Ahmet
Kasif, Ahmet
Catal, Cagatay
JOURNAL OF SUPERCOMPUTING, 2023, 79 (17): : 19934 - 19958
[10] A Transformer-Based Framework for Geomagnetic Activity Prediction
Abduallah, Yasser
Wang, Jason T. L.
Xu, Chunhui
Wang, Haimin
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022), 2022, 13515 : 325 - 335

← 1 2 3 4 5 →