Analysis of SMOTE: Modified for Diverse Imbalanced Datasets Under the IoT Environment

被引:3
|
作者
Bansal, Ankita [1 ]
Saini, Makul [1 ]
Singh, Rakshit [1 ]
Yadav, Jai Kumar [1 ]
机构
[1] Netaji Subhas Univ Technol, Delhi, India
关键词
Class Imbalance Problem; Confusion Matrix; Data Sampling; Fraud Detection; Machine Learning Classifiers; Oversampling; Random Oversampling; Undersampling; CLASSIFICATION;
D O I
10.4018/IJIRR.2021040102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The tremendous amount of data generated through IoT can be imbalanced causing class imbalance problem (CIP). CIP is one of the major issues in machine learning where most of the samples belong to one of the classes, thus producing biased classifiers. The authors in this paper are working on four imbalanced datasets belonging to diverse domains. The objective of this study is to deal with CIP using oversampling techniques. One of the commonly used oversampling approaches is synthetic minority oversampling technique (SMOTE). In this paper, the authors have suggested modifications in SMOTE and proposed their own algorithm, SMOTE-modified (SMOTE-M). To provide a fair evaluation, it is compared with three oversampling approaches, SMOTE, adaptive synthetic oversampling (ADASYN), and SMOTE-Adaboost. To evaluate the performances of sampling approaches, models are constructed using four classifiers (K-nearest neighbour, decision tree, naive Bayes, logistic regression) on balanced and imbalanced datasets. The study shows that the results of SMOTE-M are comparable to that of ADASYN and SMOTE-Adaboost.
引用
收藏
页码:15 / 37
页数:23
相关论文
共 50 条
  • [31] Clustering the Imbalanced Datasets using Modified Kohonen Self-Organizing Map (KSOM)
    Ahmad, Azlin
    Yusoff, Rubiyah
    Ismail, Mohd Najib
    Rosli, Nenny Ruthfalydia
    [J]. 2017 COMPUTING CONFERENCE, 2017, : 751 - 755
  • [32] EWT-SMOTE to improve default prediction performance in imbalanced data: Analysis of Chinese data
    Zhou, Ying
    Lin, Xia
    Chi, Guotai
    Jin, Peng
    Li, Mengtong
    [J]. JOURNAL OF FORECASTING, 2024, 43 (03) : 615 - 643
  • [33] Prediction of Emergency Mobility Under Diverse IoT Availability
    Sun, Bin
    Geng, Renkang
    Xu, Yuan
    Shen, Tao
    [J]. EAI Endorsed Transactions on Pervasive Health and Technology, 2022, 8 (04)
  • [34] A Comparative Analysis of Convergence Rate for Imbalanced Datasets of Active Learning Models
    Zhang, Haoke
    Wu, Wanqing
    Pirbhulal, Sandeep
    Li, Guanglin
    Zhang, Hongyi
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [35] Trajectory Prediction for Heterogeneous Agents: A Performance Analysis on Small and Imbalanced Datasets
    de Almeida, Tiago Rodrigues
    Zhu, Yufei
    Rudenko, Andrey
    Kucner, Tomasz P.
    Stork, Johannes A.
    Magnusson, Martin
    Lilienthal, Achim J.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6576 - 6583
  • [36] A data complexity analysis on imbalanced datasets and an alternative imbalance recovering strategy
    Weng, Cheng G.
    Poon, Josiah
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 270 - +
  • [37] IoT Nodes Behavior Analysis Under Constrained Environment Using RPL Protocol
    Ibrahimy, Saloua
    Lamaazi, Hanane
    Benamar, Nabil
    [J]. 2021 3RD IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (MENACOMM), 2021, : 74 - 79
  • [38] SEMANTIC CONCEPT DETECTION IN IMBALANCED DATASETS BASED ON DIFFERENT UNDER-SAMPLING STRATEGIES
    Guo, Jinlin
    Foley, Colum
    Gurrin, Cathal
    Lao, Songyang
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [39] Addressing data complexity for imbalanced data sets: analysis of SMOTE-based oversampling and evolutionary undersampling
    Julián Luengo
    Alberto Fernández
    Salvador García
    Francisco Herrera
    [J]. Soft Computing, 2011, 15 : 1909 - 1936
  • [40] Effect of Synthetic Minority Oversampling Technique (SMOTE), Feature Representation, and Classification Algorithm on Imbalanced Sentiment Analysis
    Satriaji, Widi
    Kusumaningrum, Retno
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 99 - 103