Vulnerability Name Prediction Based on Enhanced Multi-Source Domain Adaptation

被引:0
|
作者
Xing, Ying [1 ,2 ,3 ]
Zhao, Mengci [1 ,2 ,3 ]
Yang, Bin [4 ]
Zhang, Yuwei [5 ]
Li, Wenjin [6 ]
Gu, Jiawei [6 ]
Yuan, Jun [6 ]
Xu, Lexi [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 211106, Peoples R China
[3] Yunnan Key Lab Software Engn, Kunming 650091, Yunnan, Peoples R China
[4] China Unicom Res Inst, Beijing 100048, Peoples R China
[5] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[6] NSFOCUS Technol Grp Co Ltd, Beijing 100089, Peoples R China
关键词
vulnerability name prediction; multi-source domain adaptation; data augmentation; adversarial training; attention mechanism;
D O I
10.1109/TrustCom60117.2023.00294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software products have brought convenience to modern society but also pose significant security risks due to various types of vulnerabilities. Identifying vulnerability names is vital for program repair and software maintenance, but the lack of training data presents a challenge. Big data analytics and machine learning can help overcome this challenge by processing large amounts of data and improving the accuracy of vulnerability name prediction. Considering that the data is often from datasets composed of multiple sources, a feature-based or attention-based multi-source domain adaptation (MSDA) approach is required. In this paper, we propose an MSDA method based on both feature and attention to accomplish the task of predicting vulnerability names, called Multi-Source Domain Adaptation for Vulnerability Name Prediction (MSDA-VNP). First, MSDA-VNP reduces domain divergence by adversarial training and then uses domain-invariant features to obtain feature correlations between individual source and target domains. In combination with the obtained domain correlations, Weighted multi-kernel Maximum Mean Discrepancy (WMK-MMD) is proposed as the attention mechanism. Second, a data augmentation strategy is employed to enhance MSDA-VNP to identify privacy-related vulnerabilities. To evaluate our approach, we conducted experiments on eight Java real-world projects in the Software Assurance Reference Dataset (SARD). The experimental results show that the proposed method MSDA-VNP performed efficiently and stably for the 44 types of vulnerabilities involved. The data augmentation strategy has also been proved to be effective as an enhancement for the proposed method MSDA-VNP.
引用
收藏
页码:2115 / 2121
页数:7
相关论文
共 50 条
  • [21] Dynamic Transfer for Multi-Source Domain Adaptation
    Li, Yunsheng
    Yuan, Lu
    Chen, Yinpeng
    Wang, Pei
    Vasconcelos, Nuno
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10993 - 11002
  • [22] Multi-Source Domain Adaptation for Object Detection
    Yao, Xingxu
    Zhao, Sicheng
    Xu, Pengfei
    Yang, Jufeng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3253 - 3262
  • [23] On the analysis of adaptability in multi-source domain adaptation
    Ievgen Redko
    Amaury Habrard
    Marc Sebban
    Machine Learning, 2019, 108 : 1635 - 1652
  • [24] Multi-Source Domain Adaptation with Sinkhorn Barycenter
    Komatsu, Tatsuya
    Matsui, Tomoko
    Gao, Junbin
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1371 - 1375
  • [25] Graphical Modeling for Multi-Source Domain Adaptation
    Xu, Minghao
    Wang, Hang
    Ni, Bingbing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1727 - 1741
  • [26] Multi-Source Attention for Unsupervised Domain Adaptation
    Cui, Xia
    Bollegala, Danushka
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 873 - 883
  • [27] Multi-Source Domain Adaptation with Mixture of Experts
    Guo, Jiang
    Shah, Darsh J.
    Barzilay, Regina
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4694 - 4703
  • [28] Multi-source Domain Adaptation for Face Recognition
    Yi, Haiyang
    Xu, Zhi
    Wen, Yimin
    Fan, Zhigang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1349 - 1354
  • [29] Automatic online multi-source domain adaptation
    Renchunzi, Xie
    Pratama, Mahardhika
    INFORMATION SCIENCES, 2022, 582 : 480 - 494
  • [30] Multi-source domain adaptation for image classification
    Karimpour, Morvarid
    Noori Saray, Shiva
    Tahmoresnezhad, Jafar
    Pourmahmood Aghababa, Mohammad
    MACHINE VISION AND APPLICATIONS, 2020, 31 (06)