A Systematic Review of Synthetic Data Generation Techniques Using Generative AI

被引:0
|
作者
Goyal, Mandeep [1 ]
Mahmoud, Qusay H. [1 ]
机构
[1] Ontario Tech Univ, Dept Elect Comp & Software Engn, Oshawa, ON L1G 0C5, Canada
关键词
synthetic data; LLMs; GANs; VAEs; generative AI; neural networks; machine learning;
D O I
10.3390/electronics13173509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Synthetic data are increasingly being recognized for their potential to address serious real-world challenges in various domains. They provide innovative solutions to combat the data scarcity, privacy concerns, and algorithmic biases commonly used in machine learning applications. Synthetic data preserve all underlying patterns and behaviors of the original dataset while altering the actual content. The methods proposed in the literature to generate synthetic data vary from large language models (LLMs), which are pre-trained on gigantic datasets, to generative adversarial networks (GANs) and variational autoencoders (VAEs). This study provides a systematic review of the various techniques proposed in the literature that can be used to generate synthetic data to identify their limitations and suggest potential future research areas. The findings indicate that while these technologies generate synthetic data of specific data types, they still have some drawbacks, such as computational requirements, training stability, and privacy-preserving measures which limit their real-world usability. Addressing these issues will facilitate the broader adoption of synthetic data generation techniques across various disciplines, thereby advancing machine learning and data-driven solutions.
引用
收藏
页数:38
相关论文
共 50 条
  • [1] Generative AI: A systematic review using topic modelling techniques
    Gupta P.
    Ding B.
    Guan C.
    Ding D.
    Data and Information Management, 8 (02):
  • [2] Indoor Synthetic Data Generation: A Systematic Review
    Schieber, Hannah
    Demir, Kubilay Can
    Kleinbeck, Constantin
    Yang, Seung Hee
    Roth, Daniel
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [3] Synthetic Fingerprint Generation Using Generative Adversarial Networks: A Review
    Dhaneshwar, Ritika
    Taya, Arnav
    Kaur, Mandeep
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 1, CIS 2023, 2024, 868 : 375 - 387
  • [4] Generative Models for Synthetic Urban Mobility Data: A Systematic Literature Review
    Kapp, Alexandra
    Hansmeyer, Julia
    Mihaljevic, Helena
    ACM COMPUTING SURVEYS, 2024, 56 (04)
  • [5] Generation of Synthetic Tabular Healthcare Data Using Generative Adversarial Networks
    Nik, Alireza Hossein Zadeh
    Riegler, Michael A.
    Halvorsen, Pal
    Storas, Andrea M.
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 434 - 446
  • [6] A comprehensive review of synthetic data generation in smart farming by using variational autoencoder and generative adversarial network
    Akkem, Yaganteeswarudu
    Biswas, Saroj Kumar
    Varanasi, Aruna
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [7] Generative AI in education and research: A systematic mapping review
    Yusuf, Abdullahi
    Pervin, Nasrin
    Roman-Gonzalez, Marcos
    Noor, Norah Md
    REVIEW OF EDUCATION, 2024, 12 (02):
  • [8] Revolutionizing personalized medicine with generative AI: a systematic review
    Ghebrehiwet, Isaias
    Zaki, Nazar
    Damseh, Rafat
    Mohamad, Mohd Saberi
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (05)
  • [9] A Systematic Review for the Implication of Generative AI in Higher Education
    Al-Shabandar, Raghad
    Jaddoad, Ail
    Elwi, Taha A.
    Mohammed, A.H.
    Hussain, Abir Jaafar
    Infocommunications Journal, 2024, 16 (03): : 31 - 42
  • [10] A Systematic Review of Generative AI for Teaching and Learning Practice
    Ogunleye, Bayode
    Zakariyyah, Kudirat Ibilola
    Ajao, Oluwaseun
    Olayinka, Olakunle
    Sharma, Hemlata
    EDUCATION SCIENCES, 2024, 14 (06):