Data augmentation for numerical data from manufacturing processes: an overview of techniques and assessment of when which techniques work

被引:0
|
作者
Henry Ekwaro-Osire [1 ]
Sai Lalitha Ponugupati [2 ]
Abdullah Al Noman [3 ]
Dennis Bode [1 ]
Klaus-Dieter Thoben [2 ]
机构
[1] BIBA—Bremer Institut für Produktion und Logistik GmbH,Faculty of Production Engineering, Institute for Integrated Product Development (BIK)
[2] University of Bremen,Faculty of Electrical Engineering
[3] University of Bremen,undefined
来源
关键词
Data augmentation; Manufacturing; Process modelling; Machine learning;
D O I
10.1007/s44244-024-00021-x
中图分类号
学科分类号
摘要
Over the past two decades, machine learning (ML) has transformed manufacturing, particularly in optimizing production and quality control. A significant challenge in ML applications is obtaining sufficient training data, which data augmentation aims to address. While widely applied to image, text, and sound data, data augmentation for numerical data in manufacturing has seen limited investigation. This paper empirically compares three data augmentation techniques—generative adversarial networks, variational auto-encoders mixed with long-short-term memory, and warping—on four manufacturing datasets. It also provides a literature review, highlighting that generative models are the most common technique for numerical manufacturing data. Preliminary findings suggest that generative adversarial networks are effective for non-time-series numerical data, especially with datasets featuring many correlated model features, multiple machines, and sufficient instances and labels. This research enhances the understanding of data augmentation in manufacturing ML applications, emphasizing the need for tailored strategies.
引用
收藏
相关论文
共 50 条
  • [32] Medical image data augmentation: techniques, comparisons and interpretations
    Evgin Goceri
    Artificial Intelligence Review, 2023, 56 : 12561 - 12605
  • [33] NUMERICAL INTERPOLATION TECHNIQUES OF DIGITAL RADAR DATA FROM GATE
    PYTLOWANY, PJ
    SCHERER, WD
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 1975, 56 (01) : 142 - 142
  • [34] Batch-Balancing Improvement with Data Augmentation Techniques for Clinical Electroencephalographic Data
    Fernandez-Madera Gonzalez, David
    Moncada Martins, Fernando
    Gonzalez, Victor M.
    Villar, Jose R.
    Garcia Lopez, Beatriz
    Isabel Gomez-Menendez, Ana
    HYBRID ARTIFICIAL INTELLIGENT SYSTEM, PT I, HAIS 2024, 2025, 14857 : 16 - 28
  • [35] Flow cytometry techniques and data assessment
    Nebe, CT
    INFUSIONSTHERAPIE UND TRANSFUSIONSMEDIZIN, 1996, 23 (02): : 111 - 113
  • [36] Data Mining: Web Data Mining Techniques, Tools and Algorithms: An Overview
    Mughal, Muhammd Jawad Hamid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (06) : 208 - 215
  • [37] Some techniques for the analysis of work sampling data
    Miller, ME
    James, MK
    Langefeld, CD
    Espeland, MA
    Freedman, JA
    Martin, DK
    Smith, DM
    STATISTICS IN MEDICINE, 1996, 15 (06) : 607 - 618
  • [38] New techniques for empirical processes of dependent data
    Dehling, Herold
    Durieu, Olivier
    Volny, Dalibor
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2009, 119 (10) : 3699 - 3718
  • [39] Comparison of techniques for data reconciliation of multicomponent processes
    Rao, RR
    Narasimhan, S
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1996, 35 (04) : 1362 - 1368
  • [40] Visual processing techniques for numerical simulation data
    Cossu, R
    MODELLING AND SIMULATION 1996, 1996, : 283 - 285