Semantic Data Augmentation for Deep Learning Testing using Generative AI

被引:1
|
作者
Missaoui, Sondess [1 ]
Gerasimou, Simos [1 ]
Matragkas, Nicholas [2 ]
机构
[1] Univ York, Dept Comp Sci, York, N Yorkshire, England
[2] Univ Paris Saclay, CEA, List, Paris, France
关键词
Generative AI; Deep Learning Testing; Coverage Guided Fuzzing; Data Augmentation; Safe AI;
D O I
10.1109/ASE56229.2023.00194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.
引用
下载
收藏
页码:1694 / 1698
页数:5
相关论文
共 50 条
  • [1] Deep Generative Models for Data Synthesis and Augmentation in Machine Learning
    Adavala, Kiran Mayee
    Vhatkar, Sangeeta
    Ruprah, Taranpreet Singh
    Bhatia, Sukhwinder Kaur
    Kumar, Vipin
    Sharma, Dharmendra
    Praveen, B. Shyam
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1242 - 1249
  • [2] Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation
    Papagiannis, Tasos
    Alexandridis, Georgios
    Stafylopatis, Andreas
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [3] AI4AVP: an antiviral peptides predictor in deep learning approach with generative adversarial network data augmentation
    Lin, Tzu-Tang
    Sun, Yih-Yun
    Wang, Ching-Tien
    Cheng, Wen-Chih
    Lu, I-Hsuan
    Lin, Chung-Yen
    Chen, Shu-Hwa
    Mulder, Nicola
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [4] Distributed Raman Spectrum Data Augmentation System Using Federated Learning with Deep Generative Models
    Kim, Yaeran
    Lee, Woonghee
    SENSORS, 2022, 22 (24)
  • [5] Data Augmentation With Semantic Enrichment for Deep Learning Invoice Text Classification
    Chi, Wei Wen
    Tang, Tiong Yew
    Salleh, Narishah Mohamed
    Mukred, Muaadh
    Alsalman, Hussain
    Zohaib, Muhammad
    IEEE ACCESS, 2024, 12 : 57326 - 57344
  • [6] An Explainable Deep Learning-Based Method for Schizophrenia Diagnosis Using Generative Data-Augmentation
    Saadatinia, Mehrshad
    Salimi-Badr, Armin
    IEEE ACCESS, 2024, 12 : 98379 - 98392
  • [7] Geometric Morphometric Data Augmentation Using Generative Computational Learning Algorithms
    Courtenay, Lloyd A.
    Gonzalez-Aguilera, Diego
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 25
  • [8] Label Distribution Learning with Data Augmentation using Generative Adversarial Networks
    Rong, Bin-Yuan
    Zhang, Heng-Ru
    Li, Gui-Lin
    Min, Fan
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 21 - 30
  • [9] PIXEL LEVEL DATA AUGMENTATION FOR SEMANTIC IMAGE SEGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS
    Liu, Shuangting
    Zhang, Jiaqi
    Chen, Yuxin
    Liu, Yifan
    Qin, Zengchang
    Wan, Tao
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1902 - 1906
  • [10] Deep learning hotspots detection with generative adversarial network-based data augmentation
    Cheng, Zeyuan
    Behdinan, Kamran
    JOURNAL OF MICRO-NANOPATTERNING MATERIALS AND METROLOGY-JM3, 2022, 21 (02):