Semantic Data Augmentation for Deep Learning Testing using Generative AI

被引:1
|
作者
Missaoui, Sondess [1 ]
Gerasimou, Simos [1 ]
Matragkas, Nicholas [2 ]
机构
[1] Univ York, Dept Comp Sci, York, N Yorkshire, England
[2] Univ Paris Saclay, CEA, List, Paris, France
关键词
Generative AI; Deep Learning Testing; Coverage Guided Fuzzing; Data Augmentation; Safe AI;
D O I
10.1109/ASE56229.2023.00194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.
引用
收藏
页码:1694 / 1698
页数:5
相关论文
共 50 条
  • [31] A deep data augmentation framework based on generative adversarial networks
    Qiping Wang
    Ling Luo
    Haoran Xie
    Yanghui Rao
    Raymond Y.K. Lau
    Detian Zhang
    Multimedia Tools and Applications, 2022, 81 : 42871 - 42887
  • [32] Text Data Augmentation for Deep Learning
    Shorten, Connor
    Khoshgoftaar, Taghi M.
    Furht, Borko
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [33] Data Augmentation for Bayesian Deep Learning
    Wang, Yuexi
    Polson, Nicholas
    Sokolov, Vadim O.
    BAYESIAN ANALYSIS, 2023, 18 (04): : 1041 - 1069
  • [34] Text Data Augmentation for Deep Learning
    Connor Shorten
    Taghi M. Khoshgoftaar
    Borko Furht
    Journal of Big Data, 8
  • [35] Panel: Using Generative AI in Teaching and Learning
    Sumner, Mary
    Van Slyke, Craig
    Galletta, Dennis F.
    Niederman, Fred
    PROCEEDINGS OF THE 2024 COMPUTERS AND PEOPLE RESEARCH CONFERENCE, SIGMIS-CPR 2024, 2024,
  • [36] A deep data augmentation framework based on generative adversarial networks
    Wang, Qiping
    Luo, Ling
    Xie, Haoran
    Rao, Yanghui
    Lau, Raymond Y. K.
    Zhang, Detian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42871 - 42887
  • [38] A Domain Adaptive Semantic Segmentation Method Using Contrastive Learning and Data Augmentation
    Xiang, Yixiao
    Tian, Lihua
    Li, Chen
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [39] A Domain Adaptive Semantic Segmentation Method Using Contrastive Learning and Data Augmentation
    Yixiao Xiang
    Lihua Tian
    Chen Li
    Neural Processing Letters, 56
  • [40] Using Deep Learning in Semantic Classification for Point Cloud Data
    Yao, Xuanxia
    Guo, Jia
    Hu, Juan
    Cao, Qixuan
    IEEE ACCESS, 2019, 7 : 37121 - 37130