Imbalanced classification of manufacturing quality conditions using cost-sensitive decision tree ensembles

被引:43
|
作者
Kim, Aekyung [1 ]
Oh, Kyuhyup [1 ]
Jung, Jae-Yoon [1 ]
Kim, Bohyun [2 ]
机构
[1] Kyung Hee Univ, Dept Ind & Management Syst Engn, Yongin, South Korea
[2] Korea Inst Ind Technol, IT Converged Proc R&D Grp, Ansan, South Korea
关键词
Imbalanced classification; manufacturing quality condition classification; decision tree ensemble; cost-sensitive ensemble classification; die-casting quality analysis; DIE-CASTING PROCESS; ARTIFICIAL NEURAL-NETWORK; PROCESS PARAMETERS; SURFACE-ROUGHNESS; GENETIC ALGORITHM; PREDICTION; OPTIMIZATION; SYSTEM; DEFECT; MACHINE;
D O I
10.1080/0951192X.2017.1407447
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data-driven quality control techniques are being actively developed for implementation in smart factories. Quality prediction during manufacturing processes is a good example of how big data analytics can influence advanced manufacturing environments. In this paper, the problem of classifying manufacturing process conditions into normal and defective products according to defect types is dealt with. Such a quality analysis data set is generally unbalanced because the defective rate is quite low in practice. To solve this imbalanced classification problem, a cost-sensitive decision tree ensemble algorithm is adopted to boost the small number of defective cases and assign a higher cost to the misclassification of defective products than that of normal products. C4.5 decision trees are used as base classifiers, and three cost-sensitive ensembles, AdaC1, AdaC2 and AdaC3, are tried to address the imbalanced classification. A few types of defect conditions in a real-world die-casting data set were predicted through the proposed methods. In these experiments, the cost-sensitive ensembles were able to classify the imbalanced data and detect the defect conditions more precisely and more exactly than 19 algorithms in other classification categories such as classic classifiers and ensembles, cost-sensitive single classifiers and sampling-based ensembles. Especially, the AdaC2-based method mainly outperformed all other classification algorithms in terms of performance measures such as F-measure, G-means and AUC for the die-casting quality condition classification problem.
引用
收藏
页码:701 / 717
页数:17
相关论文
共 50 条
  • [1] Cost-sensitive decision tree ensembles for effective imbalanced classification
    Krawczyk, Bartosz
    Wozniak, Michal
    Schaefer, Gerald
    APPLIED SOFT COMPUTING, 2014, 14 : 554 - 562
  • [2] Swarm-based Cost-sensitive Decision Tree Using Optimized Rules for Imbalanced Data Classification
    Mansouri, Mehdi
    Nadimi-Shahraki, Mohammad H.
    Beheshti, Zahra
    JOURNAL OF BIONIC ENGINEERING, 2025,
  • [3] Cost-sensitive decision tree learning for forensic classification
    Davis, Jason V.
    Ha, Jungwoo
    Rossbach, Christopher J.
    Ramadan, Hany E.
    Witchel, Emmett
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 622 - 629
  • [4] Cost-sensitive boosting for classification of imbalanced data
    Sun, Yamnin
    Kamel, Mohamed S.
    Wong, Andrew K. C.
    Wang, Yang
    PATTERN RECOGNITION, 2007, 40 (12) : 3358 - 3378
  • [5] Cost-Sensitive Decision Tree Learning
    Vadera, Sunil
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 4 - 5
  • [6] Hybrid cost-sensitive decision tree
    Sheng, SL
    Ling, CX
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 274 - 284
  • [7] Improving Imbalanced Dialogue Act Classification Using Cost-Sensitive Learning
    Miyagi, Takaaki
    Endo, Satoshi
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [8] AdaCC: cumulative cost-sensitive boosting for imbalanced classification
    Iosifidis, Vasileios
    Papadopoulos, Symeon
    Rosenhahn, Bodo
    Ntoutsi, Eirini
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (02) : 789 - 826
  • [9] COST-SENSITIVE SPFCNN MINER FOR CLASSIFICATION OF IMBALANCED DATA
    Zhao, Linchang
    Shang, Zhaowei
    Zhao, Ling
    Wei, Yu
    Tang, Yuan Yan
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2019, : 51 - 57
  • [10] Cost-Sensitive Ensemble Learning for Highly Imbalanced Classification
    Johnson, Justin M.
    Khoshgoftaar, Taghi M.
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1427 - 1434