Experimental Comparison of Sampling Techniques for Imbalanced Datasets Using Various Classification Models

被引:5
|
作者
Pattanayak, Sanjibani Sudha [1 ]
Rout, Minakhi [1 ]
机构
[1] Siksha O Anusandhan Univ, ITER, Bhubaneswar 751030, Odisha, India
关键词
Sampling techniques; SMOTE; MWMOTE; SVM; RBF; MLP; SMOTE;
D O I
10.1007/978-981-10-6875-1_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced dataset is a dataset, in which the number of samples in different classes is highly uneven, which makes it very challenging for classification, i.e., classification becomes very tough as the result may get biased by the dominating class values. But misclassification of minor class sample or interested samples is very much costlier. So to provide solution to this problem, various studies have been made out of which sampling techniques are successfully adopted to preprocess the imbalance datasets. In this paper, experimental comparison of two pioneering sampling techniques SMOTE and MWMOTE is simulated using the classification models SVM, RBF, and MLP.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 50 条
  • [1] A New Hybrid Sampling Approach for Classification of Imbalanced Datasets
    Hanskunatai, Anantaporn
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 67 - 71
  • [2] Diagnosis of Breast Cancer on Imbalanced Dataset Using Various Sampling Techniques and Machine Learning Models
    Gupta, Ruchita
    Bhargava, Rupal
    Jayabalan, Manoj
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 162 - 167
  • [3] Comparison Of The Different Sampling Techniques For Imbalanced Classification Problems In Machine Learning
    Peng Zhihao
    Yan Fenglong
    Li Xucheng
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 431 - 434
  • [4] An evaluation strategy to select and discard sampling preprocessing methods for imbalanced datasets: A focus on classification models
    Rodrigues, Alexander de P.
    Luna, Aderval S.
    Pinto, Licarion
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 240
  • [5] Empirical Study of Sampling Methods for Classification in Imbalanced Clinical Datasets
    Kasem, Asem
    Ghaibeh, A. Ammar
    Moriguchi, Hiroki
    COMPUTATIONAL INTELLIGENCE IN INFORMATION SYSTEMS, CIIS 2016, 2017, 532 : 152 - 162
  • [6] Balanced Sampling Meets Imbalanced Datasets in SAR Image Classification
    Jahan, Chowdhury Sadman
    Savakis, Andreas
    GEOSPATIAL INFORMATICS XIII, 2023, 12525
  • [7] Classification for Imbalanced and Overlapping Classes Using Outlier Detection and Sampling Techniques
    Yang, Zeping
    Gao, Daqi
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 : 375 - 381
  • [8] Comparison of Evaluation Metrics in Classification Applications with Imbalanced Datasets
    Fatourechi, Mehrdad
    Ward, Rabab K.
    Mason, Steven G.
    Huggins, Jane
    Schloegl, Alois
    Birch, Gary E.
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 777 - +
  • [9] Preprocessing compensation techniques for improved classification of imbalanced medical datasets
    Wosiak, Agnieszka
    Karbowiak, Sylwia
    PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, : 203 - 211
  • [10] Exploring Data Sampling Techniques for Imbalanced Classification Problems
    Sui, Yu
    Zhang, Xiaohui
    Huan, Jiajia
    Hong, Haifeng
    FOURTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2019, 11198