Systematic review of class imbalance problems in manufacturing

被引:5
|
作者
de Giorgio, Andrea [1 ]
Cola, Gabriele [2 ]
Wang, Lihui [3 ]
机构
[1] Artificial Engn, Via R Sirignano 10, I-80121 Naples, Italy
[2] Univ Cattolica Sacro Cuore, Largo Agostino Gemelli 1, I-20123 Milan, Italy
[3] KTH Royal Inst Technol, Dept Prod Engn, Brinellvagen 68, S-11428 Stockholm, Sweden
关键词
Manufacturing; Class imbalance; Data manipulation; Machine learning; Deep learning; NEAREST-NEIGHBOR RULE; GENERATIVE ADVERSARIAL NETWORK; CONVOLUTIONAL NEURAL-NETWORK; FAULT-DETECTION; MAHALANOBIS DISTANCE; LEARNING-METHOD; LOGISTIC-REGRESSION; CONCEPT DRIFT; FUZZY ARTMAP; CLASSIFICATION;
D O I
10.1016/j.jmsy.2023.10.014
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Class imbalance (CI) is a well-known problem in data science. Nowadays, it is affecting the data modeling of many of the real-world processes that are being digitized. The manufacturing industry turns out to be highly affected by this problem, especially in fault inspection, prediction or monitoring processes, and in all those processes where the production efficiency is high and the data samples of anomalous events are rare. In this work, we systematically review all the data manipulation, machine learning or deep learning solutions to the CI problem in the manufacturing domain. We also critically evaluate all the different metrics that researchers can compare in order to estimate the improvements carried by their proposed solutions, and we look at the availability of public source code and data-imbalanced datasets that can be used for benchmarking. Finally, we summarize the most applied solutions to the CI problem in manufacturing and we look at future challenges. While posing a reference for the best practices at the time of this review, we challenge researchers to standardize the use of data science algorithms for CI in the manufacturing domain.
引用
收藏
页码:620 / 644
页数:25
相关论文
共 50 条
  • [1] A Review of Fuzzy and Pattern-Based Approaches for Class Imbalance Problems
    Lin, Ismael
    Loyola-Gonzalez, Octavio
    Monroy, Raul
    Medina-Perez, Miguel Angel
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [2] A systematic review for class-imbalance in semi-supervised learning
    de Oliveira, Willian Dihanster Gomes
    Berton, Lilian
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 2349 - 2382
  • [3] A systematic review for class-imbalance in semi-supervised learning
    Willian Dihanster Gomes de Oliveira
    Lilian Berton
    [J]. Artificial Intelligence Review, 2023, 56 : 2349 - 2382
  • [4] On the existence of a threshold in class imbalance problems
    Silva, Evandro J. R.
    Zanchettin, Cleber
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2714 - 2719
  • [5] Strategies for learning in class imbalance problems
    Barandela, R
    Sánchez, JS
    García, V
    Rangel, E
    [J]. PATTERN RECOGNITION, 2003, 36 (03) : 849 - 851
  • [6] Online Class Imbalance Learning for Quality Estimation in Manufacturing
    Lee, Kee Jin
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 1007 - 1014
  • [7] An Evaluation of Classifier Ensembles for Class Imbalance Problems
    Krawczyk, Bartosz
    Schaefer, Gerald
    Wozniak, Michal
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [8] Unsupervised Ensemble Learning for Class Imbalance Problems
    Liu, Zihan
    Wu, Dongrui
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 3593 - 3600
  • [9] On the Performance of Oversampling Techniques for Class Imbalance Problems
    Kong, Jiawen
    Rios, Thiago
    Kowalczyk, Wojtek
    Menzel, Stefan
    Back, Thomas
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 84 - 96
  • [10] Model validation failure in class imbalance problems
    Kang, Seokho
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 146