VulExplainer: A Transformer-Based Hierarchical Distillation for Explaining Vulnerability Types

被引：12

作者：

Fu, Michael ^{[1
]}

Nguyen, Van ^{[1
]}

Tantithamthavorn, Chakkrit ^{[1
]}

Le, Trung ^{[1
]}

Phung, Dinh ^{[1
]}

机构：

[1] Monash Univ, Fac Informat Technol, Melbourne, Australia

来源：

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING | 2023年 / 49卷 / 10期

关键词：

Software vulnerability; software security; CLASSIFICATION;

D O I：

10.1109/TSE.2023.3305244

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep learning-based vulnerability prediction approaches are proposed to help under-resourced security practitioners to detect vulnerable functions. However, security practitioners still do not know what type of vulnerabilities correspond to a given prediction (aka CWE-ID). Thus, a novel approach to explain the type of vulnerabilities for a given prediction is imperative. In this paper, we propose VulExplainer, an approach to explain the type of vulnerabilities. We represent VulExplainer as a vulnerability classification task. However, vulnerabilities have diverse characteristics (i.e., CWE-IDs) and the number of labeled samples in each CWE-ID is highly imbalanced (known as a highly imbalanced multi-class classification problem), which often lead to inaccurate predictions. Thus, we introduce a Transformer-based hierarchical distillation for software vulnerability classification in order to address the highly imbalanced types of software vulnerabilities. Specifically, we split a complex label distribution into sub-distributions based on CWE abstract types (i.e., categorizations that group similar CWE-IDs). Thus, similar CWE-IDs can be grouped and each group will have a more balanced label distribution. We learn TextCNN teachers on each of the simplified distributions respectively, however, they only perform well in their group. Thus, we build a transformer student model to generalize the performance of TextCNN teachers through our hierarchical knowledge distillation framework. Through an extensive evaluation using the real-world 8,636 vulnerabilities, our approach outperforms all of the baselines by 5%-29%. The results also demonstrate that our approach can be applied to Transformer-based architectures such as CodeBERT, GraphCodeBERT, and CodeGPT. Moreover, our method maintains compatibility with any Transformer-based model without requiring any architectural modifications but only adds a special distillation token to the input. These results highlight our significant contributions towards the fundamental and practical problem of explaining software vulnerability.

引用

页码：4550 / 4565

页数：16

共 50 条

[1] A transformer-based IDE plugin for vulnerability detection
Mamede, Claudia
Pinconschi, Eduard
Abreu, Rui
PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022, 2022,
[2] Transformer-based Hierarchical Encoder for Document Classification
Sakhrani, Harsh
Parekh, Saloni
Ratadiya, Pratik
21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 852 - 858
[3] Hierarchical Transformer-based Query by Multiple Documents
Huang, Zhiqi
Naseri, Shahrzad
Bonab, Hamed
Sarwar, Sheikh Muhammad
Allan, James
PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 105 - 115
[4] Transformer-Based Distillation Hash Learning for Image Retrieval
Lv, Yuanhai
Wang, Chongyan
Yuan, Wanteng
Qian, Xiaohao
Yang, Wujun
Zhao, Wanqing
ELECTRONICS, 2022, 11 (18)
[5] Transformer-Based Language Models for Software Vulnerability Detection
Thapa, Chandra
Jang, Seung Ick
Ahmed, Muhammad Ejaz
Camtepe, Seyit
Pieprzyk, Josef
Nepal, Surya
PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 481 - 496
[6] Explaining transformer-based models for automatic short answer grading
Poulton, Andrew
Eliens, Sebas
5TH INTERNATIONAL CONFERENCE ON DIGITAL TECHNOLOGY IN EDUCATION, ICDTE 2021, 2021, : 110 - 116
[7] Explaining transformer-based image captioning models: An empirical analysis
Cornia, Marcella
Baraldi, Lorenzo
Cucchiara, Rita
AI COMMUNICATIONS, 2022, 35 (02) : 111 - 129
[8] A Transformer-Based Knowledge Distillation Network for Cortical Cataract Grading
Wang, Jinhong
Xu, Zhe
Zheng, Wenhao
Ying, Haochao
Chen, Tingting
Liu, Zuozhu
Chen, Danny Z.
Yao, Ke
Wu, Jian
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (03) : 1089 - 1101
[9] Transformer-based knowledge distillation for explainable intrusion detection system
AL-Nomasy, Nadiah
Alamri, Abdulelah
Aljuhani, Ahamed
Kumar, Prabhat
COMPUTERS & SECURITY, 2025, 154
[10] TRANSFORMER-BASED HIERARCHICAL CLUSTERING FOR BRAIN NETWORK ANALYSIS
Dai, Wei
Cui, Hejie
Kan, Xuan
Guo, Ying
Van Rooij, Sanne
Yang, Carl
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,

← 1 2 3 4 5 →