Systematic review of class imbalance problems in manufacturing

被引:5
|
作者
de Giorgio, Andrea [1 ]
Cola, Gabriele [2 ]
Wang, Lihui [3 ]
机构
[1] Artificial Engn, Via R Sirignano 10, I-80121 Naples, Italy
[2] Univ Cattolica Sacro Cuore, Largo Agostino Gemelli 1, I-20123 Milan, Italy
[3] KTH Royal Inst Technol, Dept Prod Engn, Brinellvagen 68, S-11428 Stockholm, Sweden
关键词
Manufacturing; Class imbalance; Data manipulation; Machine learning; Deep learning; NEAREST-NEIGHBOR RULE; GENERATIVE ADVERSARIAL NETWORK; CONVOLUTIONAL NEURAL-NETWORK; FAULT-DETECTION; MAHALANOBIS DISTANCE; LEARNING-METHOD; LOGISTIC-REGRESSION; CONCEPT DRIFT; FUZZY ARTMAP; CLASSIFICATION;
D O I
10.1016/j.jmsy.2023.10.014
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Class imbalance (CI) is a well-known problem in data science. Nowadays, it is affecting the data modeling of many of the real-world processes that are being digitized. The manufacturing industry turns out to be highly affected by this problem, especially in fault inspection, prediction or monitoring processes, and in all those processes where the production efficiency is high and the data samples of anomalous events are rare. In this work, we systematically review all the data manipulation, machine learning or deep learning solutions to the CI problem in the manufacturing domain. We also critically evaluate all the different metrics that researchers can compare in order to estimate the improvements carried by their proposed solutions, and we look at the availability of public source code and data-imbalanced datasets that can be used for benchmarking. Finally, we summarize the most applied solutions to the CI problem in manufacturing and we look at future challenges. While posing a reference for the best practices at the time of this review, we challenge researchers to standardize the use of data science algorithms for CI in the manufacturing domain.
引用
收藏
页码:620 / 644
页数:25
相关论文
共 50 条
  • [41] BOOTSTRAP-BASED SVM AGGREGATION FOR CLASS IMBALANCE PROBLEMS
    Sukhanov, S.
    Merentitis, A.
    Debes, C.
    Hahn, J.
    Zoubir, A. M.
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 165 - 169
  • [42] A systematic study of the class imbalance problem in convolutional neural networks
    Buda, Mateusz
    Maki, Atsuto
    Mazurowski, Maciej A.
    [J]. NEURAL NETWORKS, 2018, 106 : 249 - 259
  • [43] A Systematic Study of Online Class Imbalance Learning With Concept Drift
    Wang, Shuo
    Minku, Leandro L.
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4802 - 4821
  • [44] Tackling class imbalance in computer vision: a contemporary review
    Manisha Saini
    Seba Susan
    [J]. Artificial Intelligence Review, 2023, 56 : 1279 - 1335
  • [45] Tackling class imbalance in computer vision: a contemporary review
    Saini, Manisha
    Susan, Seba
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 1) : 1279 - 1335
  • [46] Oversampling Methods to Handle the Class Imbalance Problem: A Review
    Sharma, Harsh
    Gosain, Anushika
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 96 - 110
  • [47] A Review on Solution to Class Imbalance Problem: Undersampling Approaches
    Devi, Debashree
    Biswas, Saroj K.
    Purkayastha, Biswajit
    [J]. 2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 626 - 631
  • [48] Class-overlap undersampling based on Schur decomposition for Class-imbalance problems
    Dai, Qi
    Liu, Jian-wei
    Shi, Yong-hui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [49] Imbalance factor: a simple new scale for measuring inter-class imbalance extent in classification problems
    Pirizadeh, Mohsen
    Farahani, Hadi
    Kheradpisheh, Saeed Reza
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (10) : 4157 - 4183
  • [50] Imbalance factor: a simple new scale for measuring inter-class imbalance extent in classification problems
    Mohsen Pirizadeh
    Hadi Farahani
    Saeed Reza Kheradpisheh
    [J]. Knowledge and Information Systems, 2023, 65 : 4157 - 4183