Neural Network-Based Undersampling Techniques

被引:26
|
作者
Arefeen, Md Adnan [1 ,2 ]
Nimi, Sumaiya Tabassum [1 ,2 ]
Rahman, M. Sohel [3 ]
机构
[1] Univ Missouri, Dept Comp Sci Elect Engn, Kansas City, MO 64110 USA
[2] United Int Univ, Dept Comp Sci & Engn, Dhaka 1209, Bangladesh
[3] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1205, Bangladesh
关键词
Task analysis; Noise measurement; Neurons; Machine learning algorithms; Computer science; Genetic algorithms; Autoencoder; class imbalance; classification; neural network; undersampling; CLASSIFICATION; IMBALANCE; FRAUD; SMOTE;
D O I
10.1109/TSMC.2020.3016283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning models have gained popularity nowadays for their potential to solve real-life issues when trained on pertinent data. In many cases, the real-life data are class imbalanced and hence the corresponding machine learning models trained on the data tend to perform poorly on metrics like precision, recall, AUC, F1, and G-mean score. Since class imbalance issue poses serious challenges to the performance of trained models, a multitude of research works have addressed this issue. Two common data-based sampling techniques have mostly been proposed-undersampling the data of the majority class and oversampling the data of the minority class. In this article, we focus on the former approach. We propose two novel algorithms that employ neural network-based approaches to remove majority samples that are found to reside in the vicinity of the minority samples, thereby undersampling the former to remove (or alleviate) the imbalance issue. We delineate the proposed algorithms and then test the proposed algorithms on some publicly available imbalanced datasets. We then compare the performance of our proposed algorithms to other popular undersampling algorithms. Finally, we conclude that our proposed algorithms outperform most of the existing undersampling approaches on most performance metrics.
引用
收藏
页码:1111 / 1120
页数:10
相关论文
共 50 条
  • [1] Neural network-based data mining techniques for steel making
    Sarma, RK
    Gupta, A
    Vadhavkar, S
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 401 - 414
  • [2] Deep Neural Network-Based Filtering Techniques for Data Assimilation
    Hoang, Truong-Vinh
    Matthies, Hermann G.
    ERCIM NEWS, 2020, (122): : 23 - 23
  • [3] Shape extraction: A comparative study between neural network-based and conventional techniques
    Datta, A
    Parui, SK
    NEURAL COMPUTING & APPLICATIONS, 1998, 7 (04): : 343 - 355
  • [4] Use of Neural Network-Based Deep Learning Techniques for the Diagnostics of Skin Diseases
    D. A. Gavrilov
    A. V. Melerzanov
    N. N. Shchelkunov
    E. I. Zakirov
    Biomedical Engineering, 2019, 52 : 348 - 352
  • [5] A systematic review of convolutional neural network-based structural condition assessment techniques
    Sony, Sandeep
    Dunphy, Kyle
    Sadhu, Ayan
    Capretz, Miriam
    ENGINEERING STRUCTURES, 2021, 226
  • [6] Use of Neural Network-Based Deep Learning Techniques for the Diagnostics of Skin Diseases
    Gavrilov, D. A.
    Melerzanov, A., V
    Shchelkunov, N. N.
    Zakirov, E., I
    BIOMEDICAL ENGINEERING-MEDITSINSKAYA TEKNIKA, 2019, 52 (05): : 348 - 352
  • [7] Shape extraction: A comparative study between neural network-based and conventional techniques
    A. Datta
    S. K. Parui
    Neural Computing & Applications, 1998, 7 : 343 - 355
  • [8] Recent advances in neural network-based inverse modeling techniques for microwave applications
    Jin, Jing
    Feng, Feng
    Na, Weicong
    Yan, Shuxia
    Liu, Wenyuan
    Zhu, Lin
    Zhang, Qi-Jun
    INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2020, 33 (06)
  • [9] Neural network-based face detection
    Rowley, HA
    Baluja, S
    Kanade, T
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) : 23 - 38
  • [10] Neural network-based face detection
    Rowley, HA
    Baluja, S
    Kanade, T
    IMAGE UNDERSTANDING WORKSHOP, 1996 PROCEEDINGS, VOLS I AND II, 1996, : 725 - 735