Bayesian Optimization Cost-Sensitive XGBoost Learning Algorithm for Imbalanced Data in Semiconductor Industry

被引:0
|
作者
Shamsudin, Haziqah [1 ]
Yusof, Umi Kalsom [1 ]
Kashif, Fizza [1 ]
Isa, Iza Sazanita [1 ,2 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town, Malaysia
[2] Univ Teknol MARA, Coll Engn, Ctr Elect Engn Studies, George Town, Malaysia
来源
关键词
XGBoost learning algorithm; Cost-sensitivity; Imbalanced data; Semiconductor classification; Ensembled model; CLASSIFICATION;
D O I
10.5455/jjee.204-1671971895
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an improved ensemble learning model based on extreme gradient boosting (XGBoost) with Bayesian optimization cost-sensitive learning algorithm for dealing with highly imbalanced data in the semiconductor process to achieve the highest possible pass and fail accuracy or recall for the classification performances. Most of the existing models are biased toward the majority class neglecting the minority class. The proposed Bayesian optimization cost-sensitive XGboost model is configured to be applied to the semiconductor dataset. The obtained experimental results - based on benchmarking semiconductor industry dataset - show 91.46% and 23.08% for the pass and fail accuracies, respectively. This confirms that the proposed model is significant for imbalanced cases in semiconductor applications. Moreover, this investigation reveals that the proposed model is able not only to maintain the performance of the majority class, but also to classify well the minority class.
引用
收藏
页码:552 / 565
页数:14
相关论文
共 50 条
  • [31] Cost-Sensitive Variational Autoencoding Classifier for Imbalanced Data Classification
    Liu, Fen
    Qian, Quan
    ALGORITHMS, 2022, 15 (05)
  • [32] Cost-sensitive Hybrid Neural Networks for Heterogeneous and Imbalanced Data
    Jiang, Xinxin
    Pan, Shirui
    Long, Guodong
    Chang, Jiang
    Jiang, Jing
    Zhang, Chengqi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [33] Cost-sensitive design of quadratic discriminant analysis for imbalanced data
    Bejaoui, Amine
    Elkhalil, Khalil
    Kammoun, Abla
    Alouini, Mohamed-Slim
    Al-Naffouri, Tareq
    PATTERN RECOGNITION LETTERS, 2021, 149 : 24 - 29
  • [34] Ensemble cost-sensitive hypernetwork models for imbalanced data classification
    Sun, Kaiwei, 1600, Binary Information Press (10):
  • [35] Cost-sensitive Bayesian network learning using sampling
    Nashnush, Eman
    Vadera, Sunil
    Advances in Intelligent Systems and Computing, 2014, 287 : 467 - 476
  • [36] LW-ELM: A Fast and Flexible Cost-Sensitive Learning Framework for Classifying Imbalanced Data
    Yu, Hualong
    Sun, Changyin
    Yang, Xibei
    Zheng, Shang
    Wang, Qi
    Xi, Xiaoyan
    IEEE ACCESS, 2018, 6 : 28488 - 28500
  • [37] Improving Imbalanced Dialogue Act Classification Using Cost-Sensitive Learning
    Miyagi, Takaaki
    Endo, Satoshi
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [38] Cost-Sensitive Learning for Anomaly Detection in Imbalanced ECG Data Using Convolutional Neural Networks
    Zubair, Muhammad
    Yoon, Changwoo
    SENSORS, 2022, 22 (11)
  • [39] Adaptive cost-sensitive learning: Improving the convergence of intelligent diagnosis models under imbalanced data
    Ren, Zhijun
    Zhu, Yongsheng
    Kang, Wei
    Fu, Hong
    Niu, Qingbo
    Gao, Dawei
    Yan, Ke
    Hong, Jun
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [40] Cost-Sensitive Large margin Distribution Machine for classification of imbalanced data
    Cheng, Fanyong
    Zhang, Jing
    Wen, Cuihong
    PATTERN RECOGNITION LETTERS, 2016, 80 : 107 - 112