Vulnerability Name Prediction Based on Enhanced Multi-Source Domain Adaptation

被引:0
|
作者
Xing, Ying [1 ,2 ,3 ]
Zhao, Mengci [1 ,2 ,3 ]
Yang, Bin [4 ]
Zhang, Yuwei [5 ]
Li, Wenjin [6 ]
Gu, Jiawei [6 ]
Yuan, Jun [6 ]
Xu, Lexi [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 211106, Peoples R China
[3] Yunnan Key Lab Software Engn, Kunming 650091, Yunnan, Peoples R China
[4] China Unicom Res Inst, Beijing 100048, Peoples R China
[5] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[6] NSFOCUS Technol Grp Co Ltd, Beijing 100089, Peoples R China
关键词
vulnerability name prediction; multi-source domain adaptation; data augmentation; adversarial training; attention mechanism;
D O I
10.1109/TrustCom60117.2023.00294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software products have brought convenience to modern society but also pose significant security risks due to various types of vulnerabilities. Identifying vulnerability names is vital for program repair and software maintenance, but the lack of training data presents a challenge. Big data analytics and machine learning can help overcome this challenge by processing large amounts of data and improving the accuracy of vulnerability name prediction. Considering that the data is often from datasets composed of multiple sources, a feature-based or attention-based multi-source domain adaptation (MSDA) approach is required. In this paper, we propose an MSDA method based on both feature and attention to accomplish the task of predicting vulnerability names, called Multi-Source Domain Adaptation for Vulnerability Name Prediction (MSDA-VNP). First, MSDA-VNP reduces domain divergence by adversarial training and then uses domain-invariant features to obtain feature correlations between individual source and target domains. In combination with the obtained domain correlations, Weighted multi-kernel Maximum Mean Discrepancy (WMK-MMD) is proposed as the attention mechanism. Second, a data augmentation strategy is employed to enhance MSDA-VNP to identify privacy-related vulnerabilities. To evaluate our approach, we conducted experiments on eight Java real-world projects in the Software Assurance Reference Dataset (SARD). The experimental results show that the proposed method MSDA-VNP performed efficiently and stably for the 44 types of vulnerabilities involved. The data augmentation strategy has also been proved to be effective as an enhancement for the proposed method MSDA-VNP.
引用
收藏
页码:2115 / 2121
页数:7
相关论文
共 50 条
  • [1] Multi-Source Domain Adaptation Enhanced Warehouse Dwell Time Prediction
    Zhao, Wei
    Mao, Jiali
    Lv, Xingyi
    Jin, Cheqing
    Zhou, Aoying
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2533 - 2547
  • [2] Defect Category Prediction Method Based on Multi-source Domain Adaptation
    Xing Y.
    Zhao M.-C.
    Yang B.
    Zhang Y.-W.
    Li W.-J.
    Gu J.-W.
    Yuan J.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (07): : 3227 - 3244
  • [3] Transformer Based Multi-Source Domain Adaptation
    Wright, Dustin
    Augenstein, Isabelle
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7963 - 7974
  • [4] Attention-Based Multi-Source Domain Adaptation
    Zuo, Yukun
    Yao, Hantao
    Xu, Changsheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3793 - 3803
  • [5] Multi-source based approach for Visual Domain Adaptation
    Tiwari, Mrinalini
    Sanodiya, Rakesh Kumar
    Mathew, Jimson
    Saha, Sriparna
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] A survey of multi-source domain adaptation
    Sun, Shiliang
    Shi, Honglei
    Wu, Yuanbin
    INFORMATION FUSION, 2015, 24 : 84 - 92
  • [7] Multi-Source Distilling Domain Adaptation
    Zhao, Sicheng
    Wang, Guangzhi
    Zhang, Shanghang
    Gu, Yang
    Li, Yaxian
    Song, Zhichao
    Xu, Pengfei
    Hu, Runbo
    Chai, Hua
    Keutzer, Kurt
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12975 - 12983
  • [8] BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION
    Sun, Shi-Liang
    Shi, Hong-Lei
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 24 - 28
  • [9] Multi-Source Survival Domain Adaptation
    Shaker, Ammar
    Lawrence, Carolin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9752 - 9762
  • [10] Multi-source domain adaptation with graph embedding and adaptive label prediction
    Ma, Ao
    You, Fuming
    Jing, Mengmeng
    Li, Jingjing
    Lu, Ke
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)