Experimental Comparison of Sampling Techniques for Imbalanced Datasets Using Various Classification Models

被引:5
|
作者
Pattanayak, Sanjibani Sudha [1 ]
Rout, Minakhi [1 ]
机构
[1] Siksha O Anusandhan Univ, ITER, Bhubaneswar 751030, Odisha, India
关键词
Sampling techniques; SMOTE; MWMOTE; SVM; RBF; MLP; SMOTE;
D O I
10.1007/978-981-10-6875-1_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced dataset is a dataset, in which the number of samples in different classes is highly uneven, which makes it very challenging for classification, i.e., classification becomes very tough as the result may get biased by the dominating class values. But misclassification of minor class sample or interested samples is very much costlier. So to provide solution to this problem, various studies have been made out of which sampling techniques are successfully adopted to preprocess the imbalance datasets. In this paper, experimental comparison of two pioneering sampling techniques SMOTE and MWMOTE is simulated using the classification models SVM, RBF, and MLP.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 50 条
  • [21] Comparison of classification techniques based on medical datasets
    Al-Joda, Alyaa Abdulhussein
    Abdullah, Enas Fadhil
    Alasadi, Suad A.
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 : 1957 - 1964
  • [22] DAuGAN: An Approach for Augmenting Time Series Imbalanced Datasets via Latent Space Sampling Using Adversarial Techniques
    Bratu, Andrei
    Czibula, Gabriela
    SCIENTIFIC PROGRAMMING, 2021, 2021 (2021)
  • [23] Robustness of Image Classification on Imbalanced Datasets Using Capsules Networks
    Onana, Steve
    Tchuani, Diane
    Tinku, Claude
    Fippo, Louis
    Kouamou, Georges Edouard
    RESEARCH IN COMPUTER SCIENCE, CRI 2023, 2024, 2085 : 53 - 68
  • [24] Adaptive over-sampling method for classification with application to imbalanced datasets in aluminum electrolysis
    Huang, Zhaoke
    Yang, Chunhua
    Chen, Xiaofang
    Huang, Keke
    Xie, Yongfang
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 7183 - 7199
  • [25] An Over-sampling Method Based on Probability Density Estimation for Imbalanced Datasets Classification
    Cao, Lu
    Zhai, Yi-Kui
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP'16), 2016,
  • [26] Adaptive over-sampling method for classification with application to imbalanced datasets in aluminum electrolysis
    Zhaoke Huang
    Chunhua Yang
    Xiaofang Chen
    Keke Huang
    Yongfang Xie
    Neural Computing and Applications, 2020, 32 : 7183 - 7199
  • [27] Classification of imbalanced ECG beats using re-sampling techniques and AdaBoost ensemble classifier
    Rajesh, Kandala N. V. P. S.
    Dhuli, Ravindra
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2018, 41 : 242 - 254
  • [28] Comparison of Sampling Methods for Imbalanced Data Classification in Random Forest
    Paing, May Phu
    Pintavirooj, C.
    Tungjitkusolmun, Supan
    Choomchuay, Somsak
    Hamamoto, Kazuhiko
    2018 11TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2018), 2018,
  • [29] Enhancing classification performance in imbalanced datasets: A comparative analysis of machine learning models
    Dube, Lindani
    Verster, Tanja
    DATA SCIENCE IN FINANCE AND ECONOMICS, 2023, 3 (04): : 354 - 379
  • [30] Software fault prediction with imbalanced datasets using SMOTE-Tomek sampling technique and Genetic Algorithm models
    Gupta, Mansi
    Rajnish, Kumar
    Bhattacharjee, Vandana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47627 - 47648