Data augmentation for numerical data from manufacturing processes: an overview of techniques and assessment of when which techniques work

被引:0
|
作者
Henry Ekwaro-Osire [1 ]
Sai Lalitha Ponugupati [2 ]
Abdullah Al Noman [3 ]
Dennis Bode [1 ]
Klaus-Dieter Thoben [2 ]
机构
[1] BIBA—Bremer Institut für Produktion und Logistik GmbH,Faculty of Production Engineering, Institute for Integrated Product Development (BIK)
[2] University of Bremen,Faculty of Electrical Engineering
[3] University of Bremen,undefined
来源
关键词
Data augmentation; Manufacturing; Process modelling; Machine learning;
D O I
10.1007/s44244-024-00021-x
中图分类号
学科分类号
摘要
Over the past two decades, machine learning (ML) has transformed manufacturing, particularly in optimizing production and quality control. A significant challenge in ML applications is obtaining sufficient training data, which data augmentation aims to address. While widely applied to image, text, and sound data, data augmentation for numerical data in manufacturing has seen limited investigation. This paper empirically compares three data augmentation techniques—generative adversarial networks, variational auto-encoders mixed with long-short-term memory, and warping—on four manufacturing datasets. It also provides a literature review, highlighting that generative models are the most common technique for numerical manufacturing data. Preliminary findings suggest that generative adversarial networks are effective for non-time-series numerical data, especially with datasets featuring many correlated model features, multiple machines, and sufficient instances and labels. This research enhances the understanding of data augmentation in manufacturing ML applications, emphasizing the need for tailored strategies.
引用
收藏
相关论文
共 50 条
  • [21] Augmentation techniques for sequential clinical data to improve Deep Learning prediction techniques
    Florez, Alexander Y. C.
    Scabora, Lucas
    Amer-Yahia, Sihem
    Rodrigues-Jr, Jose F.
    2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 597 - 602
  • [22] INVERSION OF ELECTROMAGNETIC DATA - AN OVERVIEW OF NEW TECHNIQUES
    OLDENBURG, D
    SURVEYS IN GEOPHYSICS, 1990, 11 (2-3) : 231 - 270
  • [23] OVERVIEW OF COMPACTION DATA-ANALYSIS TECHNIQUES
    CELIK, M
    DRUG DEVELOPMENT AND INDUSTRIAL PHARMACY, 1992, 18 (6-7) : 767 - 810
  • [24] AN OVERVIEW OF MODELING TECHNIQUES FOR HYBRID BRAIN DATA
    Guha, Apratim
    Biswas, Atanu
    STATISTICA SINICA, 2008, 18 (04) : 1311 - 1340
  • [25] Statistical techniques for microarray data: A partial overview
    Datta, S
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2003, 32 (01) : 263 - 280
  • [26] Data mining techniques applied to a manufacturing SME
    Packianather, Michael S.
    Davies, Alan
    Harraden, Sam
    Soman, Sajith
    White, John
    10TH CIRP CONFERENCE ON INTELLIGENT COMPUTATION IN MANUFACTURING ENGINEERING - CIRP ICME '16, 2017, 62 : 123 - 128
  • [27] Applying data mining techniques to wafer manufacturing
    Bertino, E
    Catania, B
    Caglio, E
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 41 - 50
  • [28] Investigation of Data Augmentation Techniques for Disordered Speech Recognition
    Geng, Mengzhe
    Xie, Xurong
    Liu, Shansong
    Yu, Jianwei
    Hu, Shoukang
    Liu, Xunying
    Meng, Helen
    INTERSPEECH 2020, 2020, : 696 - 700
  • [29] Investigation of Data Augmentation Techniques in Environmental Sound Recognition
    Sarris, Anastasios Loukas
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Dimoulas, Charalampos
    ELECTRONICS, 2024, 13 (23):
  • [30] Data Augmentation Techniques for Extreme Wind Prediction Improvement
    Vega-Bayo, Marta
    Manuel Gomez-Orellana, Antonio
    Yun, Victor Manuel Vargas
    Guijo-Rubio, David
    Cornejo-Bueno, Laura
    Perez-Aracil, Jorge
    Salcedo-Sanz, Sancho
    BIOINSPIRED SYSTEMS FOR TRANSLATIONAL APPLICATIONS: FROM ROBOTICS TO SOCIAL ENGINEERING, PT II, IWINAC 2024, 2024, 14675 : 303 - 313