Imbalanced classification of manufacturing quality conditions using cost-sensitive decision tree ensembles

被引:43
|
作者
Kim, Aekyung [1 ]
Oh, Kyuhyup [1 ]
Jung, Jae-Yoon [1 ]
Kim, Bohyun [2 ]
机构
[1] Kyung Hee Univ, Dept Ind & Management Syst Engn, Yongin, South Korea
[2] Korea Inst Ind Technol, IT Converged Proc R&D Grp, Ansan, South Korea
关键词
Imbalanced classification; manufacturing quality condition classification; decision tree ensemble; cost-sensitive ensemble classification; die-casting quality analysis; DIE-CASTING PROCESS; ARTIFICIAL NEURAL-NETWORK; PROCESS PARAMETERS; SURFACE-ROUGHNESS; GENETIC ALGORITHM; PREDICTION; OPTIMIZATION; SYSTEM; DEFECT; MACHINE;
D O I
10.1080/0951192X.2017.1407447
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data-driven quality control techniques are being actively developed for implementation in smart factories. Quality prediction during manufacturing processes is a good example of how big data analytics can influence advanced manufacturing environments. In this paper, the problem of classifying manufacturing process conditions into normal and defective products according to defect types is dealt with. Such a quality analysis data set is generally unbalanced because the defective rate is quite low in practice. To solve this imbalanced classification problem, a cost-sensitive decision tree ensemble algorithm is adopted to boost the small number of defective cases and assign a higher cost to the misclassification of defective products than that of normal products. C4.5 decision trees are used as base classifiers, and three cost-sensitive ensembles, AdaC1, AdaC2 and AdaC3, are tried to address the imbalanced classification. A few types of defect conditions in a real-world die-casting data set were predicted through the proposed methods. In these experiments, the cost-sensitive ensembles were able to classify the imbalanced data and detect the defect conditions more precisely and more exactly than 19 algorithms in other classification categories such as classic classifiers and ensembles, cost-sensitive single classifiers and sampling-based ensembles. Especially, the AdaC2-based method mainly outperformed all other classification algorithms in terms of performance measures such as F-measure, G-means and AUC for the die-casting quality condition classification problem.
引用
收藏
页码:701 / 717
页数:17
相关论文
共 50 条
  • [31] Cost-sensitive Decision Tree Induction on Dirty Data
    Qi Z.-X.
    Wang H.-Z.
    Zhou X.
    Li J.-Z.
    Gao H.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 604 - 619
  • [32] Missing or absent? A question in cost-sensitive decision tree
    Qin, Zhenxing
    Zhang, Shichao
    Zhang, Chengqi
    ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 118 - +
  • [33] Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm
    Turney, Peter D.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1994, 2 : 369 - 409
  • [34] Cost-Sensitive Large margin Distribution Machine for classification of imbalanced data
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    PATTERN RECOGNITION LETTERS, 2016, 80 : 107 - 112
  • [35] Large cost-sensitive margin distribution machine for imbalanced data classification
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    Liu, Zhaohua
    Li, Zuoyong
    NEUROCOMPUTING, 2017, 224 : 45 - 57
  • [36] Cost-Sensitive Dual-Stream Residual Networks for Imbalanced Classification
    Ma, Congcong
    Mi, Jiaqi
    Gao, Wanlin
    Tao, Sha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4243 - 4261
  • [37] Cost-Sensitive Perceptron Decision Trees for Imbalanced Drifting Data Streams
    Krawczyk, Bartosz
    Skryjomski, Przemyslaw
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 : 512 - 527
  • [38] A Cost-Sensitive Based Approach for Improving Associative Classification on Imbalanced Datasets
    Waiyamai, Kitsana
    Suwannarattaphoom, Phoonperm
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 31 - 42
  • [39] Cost-sensitive convolutional neural networks for imbalanced time series classification
    Geng, Yue
    Luo, Xinyu
    INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 357 - 370
  • [40] Cost-Sensitive Latent Space Learning for Imbalanced PolSAR Image Classification
    Wu, Qian
    Hou, Biao
    Wen, Zaidao
    Ren, Zhongle
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 4802 - 4817