Vulnerability Name Prediction Based on Enhanced Multi-Source Domain Adaptation

被引：0

作者：

Xing, Ying ^{[1
,2
,3
]}

Zhao, Mengci ^{[1
,2
,3
]}

Yang, Bin ^{[4
]}

Zhang, Yuwei ^{[5
]}

Li, Wenjin ^{[6
]}

Gu, Jiawei ^{[6
]}

Yuan, Jun ^{[6
]}

Xu, Lexi ^{[4
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[2] Nanjing Univ Aeronaut & Astronaut, Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 211106, Peoples R China

[3] Yunnan Key Lab Software Engn, Kunming 650091, Yunnan, Peoples R China

[4] China Unicom Res Inst, Beijing 100048, Peoples R China

[5] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China

[6] NSFOCUS Technol Grp Co Ltd, Beijing 100089, Peoples R China

来源：

2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023 | 2024年

关键词：

vulnerability name prediction; multi-source domain adaptation; data augmentation; adversarial training; attention mechanism;

D O I：

10.1109/TrustCom60117.2023.00294

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Software products have brought convenience to modern society but also pose significant security risks due to various types of vulnerabilities. Identifying vulnerability names is vital for program repair and software maintenance, but the lack of training data presents a challenge. Big data analytics and machine learning can help overcome this challenge by processing large amounts of data and improving the accuracy of vulnerability name prediction. Considering that the data is often from datasets composed of multiple sources, a feature-based or attention-based multi-source domain adaptation (MSDA) approach is required. In this paper, we propose an MSDA method based on both feature and attention to accomplish the task of predicting vulnerability names, called Multi-Source Domain Adaptation for Vulnerability Name Prediction (MSDA-VNP). First, MSDA-VNP reduces domain divergence by adversarial training and then uses domain-invariant features to obtain feature correlations between individual source and target domains. In combination with the obtained domain correlations, Weighted multi-kernel Maximum Mean Discrepancy (WMK-MMD) is proposed as the attention mechanism. Second, a data augmentation strategy is employed to enhance MSDA-VNP to identify privacy-related vulnerabilities. To evaluate our approach, we conducted experiments on eight Java real-world projects in the Software Assurance Reference Dataset (SARD). The experimental results show that the proposed method MSDA-VNP performed efficiently and stably for the 44 types of vulnerabilities involved. The data augmentation strategy has also been proved to be effective as an enhancement for the proposed method MSDA-VNP.

引用

页码：2115 / 2121

页数：7

共 50 条

[41] Multi-EPL: Accurate multi-source domain adaptation
Lee, Seongmin
Jeon, Hyunsik
Kang, U.
PLOS ONE, 2021, 16 (08):
[42] Multi-source unsupervised domain adaptation for object detection
Zhang, Dan
Ye, Mao
Liu, Yiguang
Xiong, Lin
Zhou, Lihua
INFORMATION FUSION, 2022, 78 : 138 - 148
[43] STEM: An approach to Multi-source Domain Adaptation with Guarantees
Nguyen, Van-Anh
Nguyen, Tuan
Le, Trung
Tran, Quan Hung
Phung, Dinh
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9332 - 9343
[44] Weighted progressive alignment for multi-source domain adaptation
Kunhong Wu
Liang Li
Yahong Han
Multimedia Systems, 2023, 29 : 117 - 128
[45] Riemannian representation learning for multi-source domain adaptation
Chen, Sentao
Zheng, Lin
Wu, Hanrui
PATTERN RECOGNITION, 2023, 137
[46] Multi-Source Unsupervised Domain Adaptation with Prototype Aggregation
Huang, Min
Xie, Zifeng
Sun, Bo
Wang, Ning
MATHEMATICS, 2025, 13 (04)
[47] Multi-Source Domain Adaptation for Visual Sentiment Classification
Lin, Chuang
Zhao, Sicheng
Meng, Lei
Chua, Tat-Seng
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2661 - 2668
[48] Improved multi-source domain adaptation by preservation of factors
Schrom, Sebastian
Hasler, Stephan
Adamy, Juergen
IMAGE AND VISION COMPUTING, 2021, 112
[49] Universal multi-Source domain adaptation for image classification
Yin, Yueming
Yang, Zhen
Hu, Haifeng
Wu, Xiaofu
PATTERN RECOGNITION, 2022, 121
[50] Leveraging Mixture Alignment for Multi-Source Domain Adaptation
Dayal, Aveen
Shrusti, S.
Cenkeramaddi, Linga Reddy
Mohan, C. Krishna
Kumar, Abhinav
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 885 - 898

← 1 2 3 4 5 →