Fuzzy Distance-based Undersampling Technique for Imbalanced Flood Data

被引:0
|
作者
Mahamud, Ku Ruhana Ku [1 ]
Zorkeflee, Maisarah [1 ]
Din, Aniza Mohamed [1 ]
机构
[1] Univ Utara Malaysia, Changlun, Malaysia
关键词
imbalanced flood data; resampling technique; fuzzy distance-based undersampling; fuzzy logic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performances of classifiers are affected by imbalanced data because instances in the minority class are often ignored. Imbalanced data often occur in many application domains including flood. If flood cases are misclassified, the impact of flood is higher than the misclassification of non-flood cases. Numerous resampling techniques such as undersampling and oversampling have been used to overcome the problem of misclassification of imbalanced data. However, the undersampling and oversampling techniques suffer from elimination of relevant data and overfitting, which may lead to poor classification results. This paper proposes a Fuzzy Distance-based Undersampling (FDUS) technique to increase classification accuracy. Entropy estimation is used to generate fuzzy thresholds which are used to categorise the instances in majority and minority classes into membership functions. The performance of FDUS was compared with three techniques based on Fmeasure and G-mean, experimented on flood data. From the results, FDUS achieved better F-measure and G-mean compared to the other techniques which showed that the FDUS was able to reduce the elimination of relevant data.
引用
收藏
页码:509 / 513
页数:5
相关论文
共 50 条
  • [1] Distance-based arranging oversampling technique for imbalanced data
    Qi Dai
    Jian-wei Liu
    Jia-Liang Zhao
    [J]. Neural Computing and Applications, 2023, 35 : 1323 - 1342
  • [2] Distance-based arranging oversampling technique for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Zhao, Jia-Liang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1323 - 1342
  • [3] An Earth mover's distance-based undersampling approach for handling class-imbalanced data
    Rekha, Gillala
    Krishna Reddy, V.
    Tyagi, Amit Kumar
    [J]. International Journal of Intelligent Information and Database Systems, 2020, 13 (2-4) : 376 - 392
  • [5] Classifying imbalanced data in distance-based feature space
    Shin Ando
    [J]. Knowledge and Information Systems, 2016, 46 : 707 - 730
  • [6] A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification
    Kang, Qi
    Shi, Lei
    Zhou, MengChu
    Wang, XueSong
    Wu, Qidi
    Wei, Zhi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4152 - 4165
  • [7] A fuzzy rough set-based undersampling approach for imbalanced data
    Zhang, Xiao
    He, Zhaoqian
    Yang, Yanyan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2799 - 2810
  • [8] MLP-Based Undersampling Technique for Imbalanced Learning
    Babar, Varsha
    Ade, Roshani
    [J]. 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 142 - 147
  • [9] A Wasserstein Distance-Based Cost-Sensitive Framework for Imbalanced Data Classification
    Feng, Rui
    Ji, Hongbing
    Zhu, Zhigang
    Wang, Lei
    [J]. RADIOENGINEERING, 2023, 32 (03) : 451 - 466
  • [10] A Kemeny Distance-Based Robust Fuzzy Clustering for Preference Data
    Pierpaolo D’Urso
    Vincenzina Vitale
    [J]. Journal of Classification, 2022, 39 : 600 - 647