Vulnerability Name Prediction Based on Enhanced Multi-Source Domain Adaptation

被引:0
|
作者
Xing, Ying [1 ,2 ,3 ]
Zhao, Mengci [1 ,2 ,3 ]
Yang, Bin [4 ]
Zhang, Yuwei [5 ]
Li, Wenjin [6 ]
Gu, Jiawei [6 ]
Yuan, Jun [6 ]
Xu, Lexi [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 211106, Peoples R China
[3] Yunnan Key Lab Software Engn, Kunming 650091, Yunnan, Peoples R China
[4] China Unicom Res Inst, Beijing 100048, Peoples R China
[5] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[6] NSFOCUS Technol Grp Co Ltd, Beijing 100089, Peoples R China
关键词
vulnerability name prediction; multi-source domain adaptation; data augmentation; adversarial training; attention mechanism;
D O I
10.1109/TrustCom60117.2023.00294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software products have brought convenience to modern society but also pose significant security risks due to various types of vulnerabilities. Identifying vulnerability names is vital for program repair and software maintenance, but the lack of training data presents a challenge. Big data analytics and machine learning can help overcome this challenge by processing large amounts of data and improving the accuracy of vulnerability name prediction. Considering that the data is often from datasets composed of multiple sources, a feature-based or attention-based multi-source domain adaptation (MSDA) approach is required. In this paper, we propose an MSDA method based on both feature and attention to accomplish the task of predicting vulnerability names, called Multi-Source Domain Adaptation for Vulnerability Name Prediction (MSDA-VNP). First, MSDA-VNP reduces domain divergence by adversarial training and then uses domain-invariant features to obtain feature correlations between individual source and target domains. In combination with the obtained domain correlations, Weighted multi-kernel Maximum Mean Discrepancy (WMK-MMD) is proposed as the attention mechanism. Second, a data augmentation strategy is employed to enhance MSDA-VNP to identify privacy-related vulnerabilities. To evaluate our approach, we conducted experiments on eight Java real-world projects in the Software Assurance Reference Dataset (SARD). The experimental results show that the proposed method MSDA-VNP performed efficiently and stably for the 44 types of vulnerabilities involved. The data augmentation strategy has also been proved to be effective as an enhancement for the proposed method MSDA-VNP.
引用
收藏
页码:2115 / 2121
页数:7
相关论文
共 50 条
  • [31] Moment Matching for Multi-Source Domain Adaptation
    Peng, Xingchao
    Bai, Qinxun
    Xia, Xide
    Huang, Zijun
    Saenko, Kate
    Wang, Bo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1406 - 1415
  • [32] Subspace Identification for Multi-Source Domain Adaptation
    Li, Zijian
    Cai, Ruichu
    Chen, Guangyi
    Sun, Boyang
    Hao, Zhifeng
    Zhang, Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] Multi-source domain adaptation for image classification
    Morvarid Karimpour
    Shiva Noori Saray
    Jafar Tahmoresnezhad
    Mohammad Pourmahmood Aghababa
    Machine Vision and Applications, 2020, 31
  • [34] Transformer-Based Multi-Source Domain Adaptation Without Source Data
    Li, Gang
    Wu, Chao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [35] Evidential combination of augmented multi-source of information based on domain adaptation
    Linqing Huang
    Zhunga Liu
    Quan Pan
    Jean Dezert
    Science China Information Sciences, 2020, 63
  • [36] Evidential combination of augmented multi-source of information based on domain adaptation
    Huang, Linqing
    Liu, Zhunga
    Pan, Quan
    Dezert, Jean
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (11)
  • [37] Multi-Source Domain Adaptation via Latent Domain Reconstruction
    Zhou, Jun
    Fu, Chilin
    Zhang, Xiaolu
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 523 - 527
  • [38] Building damage detection based on multi-source adversarial domain adaptation
    Wang, Xiang
    Li, Yundong
    Lin, Chen
    Liu, Yi
    Geng, Shuo
    JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (03)
  • [39] Evidential combination of augmented multi-source of information based on domain adaptation
    Linqing HUANG
    Zhunga LIU
    Quan PAN
    Jean DEZERT
    ScienceChina(InformationSciences), 2020, 63 (11) : 38 - 55
  • [40] Unsupervised multi-source domain adaptation with no observable source data
    Jeon, Hyunsik
    Lee, Seongmin
    Kang, U.
    PLOS ONE, 2021, 16 (07):