An Iterative Undersampling of Extremely Imbalanced Data Using CSVM

被引:0
|
作者
Lee, Jong Bum [1 ,2 ]
Lee, Jee-Hyong [2 ]
机构
[1] Samsung Elect, Semicond Div, Seoul, South Korea
[2] Sungkyunkwan Univ, Coll Informat & Commun Engn, Suwon, South Korea
关键词
Heuristic undersampling method; semiconductor; imbalanced data; Cost-sensitive Support Vector Machine; SMOTE;
D O I
10.1117/12.2181517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semiconductor is a major component of electronic devices and is required very high reliability and productivity. If defective chip predict in advance, the product quality will be improved and productivity will increases by reduction of test cost. However, the performance of the classifiers about defective chips is very poor due to semiconductor data is extremely imbalance, as roughly 1: 1000. In this paper, the iterative undersampling method using CSVM is employed to deal with the class imbalanced. The main idea is to select the informative majority class samples around the decision boundary determined by classify. Our experimental results are reported to demonstrate that our method outperforms the other sampling methods in regard with the accuracy of defective chip in highly imbalanced data.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Evolutionary Undersampling for Extremely Imbalanced Big Data Classification under Apache Spark
    Triguero, I.
    Galar, M.
    Merino, D.
    Maillo, J.
    Bustince, H.
    Herrera, F.
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 640 - 647
  • [2] Clustering Based Undersampling for Effective Learning from Imbalanced Data: An Iterative Approach
    Bhattacharya R.
    De R.
    Chakraborty A.
    Sarkar R.
    [J]. SN Computer Science, 5 (4)
  • [3] An approach for classification of highly imbalanced data using weighting and undersampling
    Ashish Anand
    Ganesan Pugalenthi
    Gary B. Fogel
    P. N. Suganthan
    [J]. Amino Acids, 2010, 39 : 1385 - 1391
  • [4] An approach for classification of highly imbalanced data using weighting and undersampling
    Anand, Ashish
    Pugalenthi, Ganesan
    Fogel, Gary B.
    Suganthan, P. N.
    [J]. AMINO ACIDS, 2010, 39 (05) : 1385 - 1391
  • [5] Evolutionary Undersampling for Imbalanced Big Data Classification
    Triguero, I.
    Galar, M.
    Vluymans, S.
    Cornelis, C.
    Bustince, H.
    Herrera, F.
    Saeys, Y.
    [J]. 2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 715 - 722
  • [6] Partial Undersampling of Imbalanced Data for Cyber Threats Detection
    Moniruzzaman, Md
    Bagirov, A. M.
    Gondal, Iqbal
    [J]. PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2020), 2020,
  • [7] Undersampling Instance Selection for Hybrid and Incomplete Imbalanced Data
    Camacho-Nieto, Oscar
    Yanez-Marquez, Cornelio
    Villuendas-Rey, Yenny
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (06) : 698 - 719
  • [8] Radial-Based Undersampling for imbalanced data classification
    Koziarski, Michal
    [J]. PATTERN RECOGNITION, 2020, 102
  • [9] A Membership Probability–Based Undersampling Algorithm for Imbalanced Data
    Gilseung Ahn
    You-Jin Park
    Sun Hur
    [J]. Journal of Classification, 2021, 38 : 2 - 15
  • [10] On Properties of Undersampling Bagging and Its Extensions for Imbalanced Data
    Stefanowski, Jerzy
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 407 - 417