Semantic Data Augmentation for Deep Learning Testing using Generative AI

被引：1

作者：

Missaoui, Sondess ^{[1
]}

Gerasimou, Simos ^{[1
]}

Matragkas, Nicholas ^{[2
]}

机构：

[1] Univ York, Dept Comp Sci, York, N Yorkshire, England

[2] Univ Paris Saclay, CEA, List, Paris, France

来源：

2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE | 2023年

关键词：

Generative AI; Deep Learning Testing; Coverage Guided Fuzzing; Data Augmentation; Safe AI;

D O I：

10.1109/ASE56229.2023.00194

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.

引用

页码：1694 / 1698

页数：5

共 50 条

[31] A deep data augmentation framework based on generative adversarial networks
Qiping Wang
Ling Luo
Haoran Xie
Yanghui Rao
Raymond Y.K. Lau
Detian Zhang
Multimedia Tools and Applications, 2022, 81 : 42871 - 42887
[32] Text Data Augmentation for Deep Learning
Shorten, Connor
Khoshgoftaar, Taghi M.
Furht, Borko
JOURNAL OF BIG DATA, 2021, 8 (01)
[33] Data Augmentation for Bayesian Deep Learning
Wang, Yuexi
Polson, Nicholas
Sokolov, Vadim O.
BAYESIAN ANALYSIS, 2023, 18 (04): : 1041 - 1069
[34] Text Data Augmentation for Deep Learning
Connor Shorten
Taghi M. Khoshgoftaar
Borko Furht
Journal of Big Data, 8
[35] Panel: Using Generative AI in Teaching and Learning
Sumner, Mary
Van Slyke, Craig
Galletta, Dennis F.
Niederman, Fred
PROCEEDINGS OF THE 2024 COMPUTERS AND PEOPLE RESEARCH CONFERENCE, SIGMIS-CPR 2024, 2024,
[36] A deep data augmentation framework based on generative adversarial networks
Wang, Qiping
Luo, Ling
Xie, Haoran
Rao, Yanghui
Lau, Raymond Y. K.
Zhang, Detian
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42871 - 42887
[37] Educational Data Analysis using Generative AI
1600, CEUR-WS (3667):
[38] A Domain Adaptive Semantic Segmentation Method Using Contrastive Learning and Data Augmentation
Xiang, Yixiao
Tian, Lihua
Li, Chen
NEURAL PROCESSING LETTERS, 2024, 56 (02)
[39] A Domain Adaptive Semantic Segmentation Method Using Contrastive Learning and Data Augmentation
Yixiao Xiang
Lihua Tian
Chen Li
Neural Processing Letters, 56
[40] Using Deep Learning in Semantic Classification for Point Cloud Data
Yao, Xuanxia
Guo, Jia
Hu, Juan
Cao, Qixuan
IEEE ACCESS, 2019, 7 : 37121 - 37130

← 1 2 3 4 5 →