Semantic Data Augmentation for Deep Learning Testing using Generative AI

被引:1
|
作者
Missaoui, Sondess [1 ]
Gerasimou, Simos [1 ]
Matragkas, Nicholas [2 ]
机构
[1] Univ York, Dept Comp Sci, York, N Yorkshire, England
[2] Univ Paris Saclay, CEA, List, Paris, France
关键词
Generative AI; Deep Learning Testing; Coverage Guided Fuzzing; Data Augmentation; Safe AI;
D O I
10.1109/ASE56229.2023.00194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.
引用
收藏
页码:1694 / 1698
页数:5
相关论文
共 50 条
  • [41] Enhancing Text Classification Models with Generative AI-aided Data Augmentation
    Zhao, Huanhuan
    Chen, Haihua
    Yoon, Hong-Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 138 - 145
  • [42] Seismic data interpolation using deep learning with generative adversarial networks
    Kaur, Harpreet
    Pham, Nam
    Fomel, Sergey
    GEOPHYSICAL PROSPECTING, 2021, 69 (02) : 307 - 326
  • [43] Diversity in Deep Generative Models and Generative AI
    Turinici, Gabriel
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 84 - 93
  • [44] Data augmentation using conditional generative adversarial network (cGAN): applications for sewer condition classification and testing using different machine learning techniques
    Woldesellasse, Haile
    Tesfamariam, Solomon
    JOURNAL OF HYDROINFORMATICS, 2024, 26 (07) : 1471 - 1489
  • [45] Prediction of Research Project Execution using Data Augmentation and Deep Learning
    Flores, Anibal
    Tito-Chura, Hugo
    Zea-Rospigliosi, Lissethe
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2023, 26 (71): : 46 - 58
  • [46] Deep Learning for Topmost Roller Chain Detection Using Data Augmentation
    Wang, Yulin
    Zhou, Yijun
    Luo, Chen
    2019 4TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2019), 2019, : 443 - 446
  • [47] Research on Data Augmentation for Lithography Hotspot Detection Using Deep Learning
    Borisov, Vadim
    Scheible, Juergen
    34TH EUROPEAN MASK AND LITHOGRAPHY CONFERENCE, 2018, 10775
  • [48] The Effect of Data Augmentation on ADHD Diagnostic Model using Deep Learning
    Cicek, Gulay
    Ozmen, Atilla
    Akan, Aydin
    2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 165 - 168
  • [49] Fingerprint pattern classification using deep transfer learning and data augmentation
    Ametefe, Divine Senanu
    Sarnin, Suzi Seroja
    Ali, Darmawaty Mohd
    Muhammad, Zaigham Zaheer
    VISUAL COMPUTER, 2023, 39 (04): : 1703 - 1716
  • [50] A Study of Data Augmentation for Handwritten Character Recognition Using Deep Learning
    Hayashi, Taihei
    Gyohten, Keiji
    Ohki, Hidehiro
    Takami, Toshiya
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 552 - 557