Vulnerability Name Prediction Based on Enhanced Multi-Source Domain Adaptation

被引:0
|
作者
Xing, Ying [1 ,2 ,3 ]
Zhao, Mengci [1 ,2 ,3 ]
Yang, Bin [4 ]
Zhang, Yuwei [5 ]
Li, Wenjin [6 ]
Gu, Jiawei [6 ]
Yuan, Jun [6 ]
Xu, Lexi [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Minist Ind & Informat Technol, Key Lab Safety Crit Software Dev & Verificat, Nanjing 211106, Peoples R China
[3] Yunnan Key Lab Software Engn, Kunming 650091, Yunnan, Peoples R China
[4] China Unicom Res Inst, Beijing 100048, Peoples R China
[5] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[6] NSFOCUS Technol Grp Co Ltd, Beijing 100089, Peoples R China
关键词
vulnerability name prediction; multi-source domain adaptation; data augmentation; adversarial training; attention mechanism;
D O I
10.1109/TrustCom60117.2023.00294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software products have brought convenience to modern society but also pose significant security risks due to various types of vulnerabilities. Identifying vulnerability names is vital for program repair and software maintenance, but the lack of training data presents a challenge. Big data analytics and machine learning can help overcome this challenge by processing large amounts of data and improving the accuracy of vulnerability name prediction. Considering that the data is often from datasets composed of multiple sources, a feature-based or attention-based multi-source domain adaptation (MSDA) approach is required. In this paper, we propose an MSDA method based on both feature and attention to accomplish the task of predicting vulnerability names, called Multi-Source Domain Adaptation for Vulnerability Name Prediction (MSDA-VNP). First, MSDA-VNP reduces domain divergence by adversarial training and then uses domain-invariant features to obtain feature correlations between individual source and target domains. In combination with the obtained domain correlations, Weighted multi-kernel Maximum Mean Discrepancy (WMK-MMD) is proposed as the attention mechanism. Second, a data augmentation strategy is employed to enhance MSDA-VNP to identify privacy-related vulnerabilities. To evaluate our approach, we conducted experiments on eight Java real-world projects in the Software Assurance Reference Dataset (SARD). The experimental results show that the proposed method MSDA-VNP performed efficiently and stably for the 44 types of vulnerabilities involved. The data augmentation strategy has also been proved to be effective as an enhancement for the proposed method MSDA-VNP.
引用
收藏
页码:2115 / 2121
页数:7
相关论文
共 50 条
  • [41] Multi-EPL: Accurate multi-source domain adaptation
    Lee, Seongmin
    Jeon, Hyunsik
    Kang, U.
    PLOS ONE, 2021, 16 (08):
  • [42] Multi-source unsupervised domain adaptation for object detection
    Zhang, Dan
    Ye, Mao
    Liu, Yiguang
    Xiong, Lin
    Zhou, Lihua
    INFORMATION FUSION, 2022, 78 : 138 - 148
  • [43] STEM: An approach to Multi-source Domain Adaptation with Guarantees
    Nguyen, Van-Anh
    Nguyen, Tuan
    Le, Trung
    Tran, Quan Hung
    Phung, Dinh
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9332 - 9343
  • [44] Weighted progressive alignment for multi-source domain adaptation
    Kunhong Wu
    Liang Li
    Yahong Han
    Multimedia Systems, 2023, 29 : 117 - 128
  • [45] Riemannian representation learning for multi-source domain adaptation
    Chen, Sentao
    Zheng, Lin
    Wu, Hanrui
    PATTERN RECOGNITION, 2023, 137
  • [46] Multi-Source Unsupervised Domain Adaptation with Prototype Aggregation
    Huang, Min
    Xie, Zifeng
    Sun, Bo
    Wang, Ning
    MATHEMATICS, 2025, 13 (04)
  • [47] Multi-Source Domain Adaptation for Visual Sentiment Classification
    Lin, Chuang
    Zhao, Sicheng
    Meng, Lei
    Chua, Tat-Seng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2661 - 2668
  • [48] Improved multi-source domain adaptation by preservation of factors
    Schrom, Sebastian
    Hasler, Stephan
    Adamy, Juergen
    IMAGE AND VISION COMPUTING, 2021, 112
  • [49] Universal multi-Source domain adaptation for image classification
    Yin, Yueming
    Yang, Zhen
    Hu, Haifeng
    Wu, Xiaofu
    PATTERN RECOGNITION, 2022, 121
  • [50] Leveraging Mixture Alignment for Multi-Source Domain Adaptation
    Dayal, Aveen
    Shrusti, S.
    Cenkeramaddi, Linga Reddy
    Mohan, C. Krishna
    Kumar, Abhinav
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 885 - 898