Vulnerability Name Prediction Based on Enhanced Multi-Source Domain Adaptation

被引:0
|
作者
Xing, Ying [1 ,2 ,3 ]
Zhao, Mengci [1 ,2 ,3 ]
Yang, Bin [4 ]
Zhang, Yuwei [5 ]
Li, Wenjin [6 ]
Gu, Jiawei [6 ]
Yuan, Jun [6 ]
Xu, Lexi [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 211106, Peoples R China
[3] Yunnan Key Lab Software Engn, Kunming 650091, Yunnan, Peoples R China
[4] China Unicom Res Inst, Beijing 100048, Peoples R China
[5] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[6] NSFOCUS Technol Grp Co Ltd, Beijing 100089, Peoples R China
关键词
vulnerability name prediction; multi-source domain adaptation; data augmentation; adversarial training; attention mechanism;
D O I
10.1109/TrustCom60117.2023.00294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software products have brought convenience to modern society but also pose significant security risks due to various types of vulnerabilities. Identifying vulnerability names is vital for program repair and software maintenance, but the lack of training data presents a challenge. Big data analytics and machine learning can help overcome this challenge by processing large amounts of data and improving the accuracy of vulnerability name prediction. Considering that the data is often from datasets composed of multiple sources, a feature-based or attention-based multi-source domain adaptation (MSDA) approach is required. In this paper, we propose an MSDA method based on both feature and attention to accomplish the task of predicting vulnerability names, called Multi-Source Domain Adaptation for Vulnerability Name Prediction (MSDA-VNP). First, MSDA-VNP reduces domain divergence by adversarial training and then uses domain-invariant features to obtain feature correlations between individual source and target domains. In combination with the obtained domain correlations, Weighted multi-kernel Maximum Mean Discrepancy (WMK-MMD) is proposed as the attention mechanism. Second, a data augmentation strategy is employed to enhance MSDA-VNP to identify privacy-related vulnerabilities. To evaluate our approach, we conducted experiments on eight Java real-world projects in the Software Assurance Reference Dataset (SARD). The experimental results show that the proposed method MSDA-VNP performed efficiently and stably for the 44 types of vulnerabilities involved. The data augmentation strategy has also been proved to be effective as an enhancement for the proposed method MSDA-VNP.
引用
收藏
页码:2115 / 2121
页数:7
相关论文
共 50 条
  • [11] MixtureWeight Estimation and Model Prediction in Multi-source Multi-target Domain Adaptation
    Deng, Yuyang
    Kuzborskij, Ilja
    Mahdavi, Mehrdad
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [12] A multi-source transfer learning model based on LSTM and domain adaptation for building energy prediction
    Lu, Huiming
    Wu, Jiazheng
    Ruan, Yingjun
    Qian, Fanyue
    Meng, Hua
    Gao, Yuan
    Xu, Tingting
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 149
  • [13] Multi-source multi-modal domain adaptation
    Zhao, Sicheng
    Jiang, Jing
    Tang, Wenbo
    Zhu, Jiankun
    Chen, Hui
    Xu, Pengfei
    Schuller, Bjorn W.
    Tao, Jianhua
    Yao, Hongxun
    Ding, Guiguang
    INFORMATION FUSION, 2025, 117
  • [14] Wasserstein Barycenter for Multi-Source Domain Adaptation
    Montesuma, Eduardo Fernandes
    Mboula, Fred Maurice Ngole
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16780 - 16788
  • [15] Unsupervised Multi-source Domain Adaptation for Regression
    Richard, Guillaume
    de Mathelin, Antoine
    Hebrail, Georges
    Mougeot, Mathilde
    Vayatis, Nicolas
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT I, 2021, 12457 : 395 - 411
  • [16] On the analysis of adaptability in multi-source domain adaptation
    Redko, Ievgen
    Habrard, Amaury
    Sebban, Marc
    MACHINE LEARNING, 2019, 108 (8-9) : 1635 - 1652
  • [17] Multi-source Domain Adaptation for Semantic Segmentation
    Zhao, Sicheng
    Li, Bo
    Yue, Xiangyu
    Gu, Yang
    Xu, Pengfei
    Hu, Runbo
    Chai, Hua
    Keutzer, Kurt
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [18] Multi-Source Contribution Learning for Domain Adaptation
    Li, Keqiuyin
    Lu, Jie
    Zuo, Hua
    Zhang, Guangquan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5293 - 5307
  • [19] Multi-Source Domain Adaptation: A Causal View
    Zhang, Kun
    Gong, Mingming
    Schoelkopf, Bernhard
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3150 - 3157
  • [20] Coupled Training for Multi-Source Domain Adaptation
    Amosy, Ohad
    Chechik, Gal
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1071 - 1080