BENN: Balanced Ensemble Neural Network for Handling Class Imbalance in Big Data

被引:0
|
作者
Ramesh, Sneha Halebeedu [1 ,2 ]
Basava, Annappa [1 ]
Perumal, Sankar Pariserum [1 ]
机构
[1] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal, India
[2] Nitte Meenakshi Inst Technol, Dept Informat Sci & Engn, Bengaluru, India
关键词
concept drift; decision tree regression; decision trees; machine learning; national health dataset; random forest;
D O I
10.1111/exsy.13754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance is a critical challenge in big data analytics, often leading to biased predictive models. This imbalance can lead to biased models that perform well on the majority class but poorly on the minority class. Many machine learning models tend to be biased towards the majority class because they aim to minimise overall error, often leading to poor performance on the minority class. This paper presents the balanced ensemble neural network, a novel solution to effectively address class imbalance in big data. Balanced ensemble neural network combines the robust capabilities of neural networks with the power of ensemble learning, incorporating class balancing strategies to ensure fair representation of minority classes. The methodology involves integrating multiple neural networks, each trained on balanced subsets of data using techniques like Synthetic Minority Over-sampling Technique and Random Undersampling. This integration aims to leverage the strengths of individual networks while reducing their inherent biases. Our extensive experiments across various datasets reveal that BENN achieves an AUC-ROC score of 0.94, surpassing other models such as random forest (0.88), support vector (0.84) and single neural net (0.80). It was also observed that BENN's performance is better compared to traditional neural network models and standard ensemble methods in key metrics like accuracy, precision, recall, F1-score and AUC-ROC. The results specifically highlight BENN's effectiveness in accurately classifying instances of minority classes, a notable challenge in many existing models. These findings underscore BENN's potential as a substantial advancement in handling class imbalance within big data environments, offering a promising direction for future research and application in machine learning.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Feedforward neural network models for handling class overlap and class imbalance
    Kretzschmar, R
    Karayiannis, NB
    Eggimann, F
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (05) : 323 - 338
  • [2] DUEN: Dynamic ensemble handling class imbalance in network intrusion detection
    Ren, Huajuan
    Tang, Yonghe
    Dong, Weiyu
    Ren, Shuai
    Jiang, Liehui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [3] Hybrid Deep Neural Network for Handling Data Imbalance in Precursor MicroRNA
    Elakkiya, R.
    Jain, Deepak Kumar
    Kotecha, Ketan
    Pandya, Sharnil
    Reddy, Sai Siddhartha
    Rajalakshmi, E.
    Varadarajan, Vijayakumar
    Mahanti, Aniket
    Subramaniyaswamy, Subramaniyaswamy
    FRONTIERS IN PUBLIC HEALTH, 2021, 9
  • [4] A hybrid method based on ensemble WELM for handling multi class imbalance in cancer microarray data
    Liu, Zhen
    Tang, Deyu
    Cai, Yongming
    Wang, Ruoyu
    Chen, Fuhua
    NEUROCOMPUTING, 2017, 266 : 641 - 650
  • [5] Modeling of class imbalance handling with optimal deep learning enabled big data classification model
    Varshavardhini, S.
    Rajesh, A.
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (04): : 1179 - 1197
  • [6] Experimental evaluation of ensemble classifiers for imbalance in Big Data
    Juez-Gil M.
    Arnaiz-González Á.
    Rodríguez J.J.
    García-Osorio C.
    Applied Soft Computing, 2021, 108
  • [7] A novel cost sensitive neural network ensemble for multiclass imbalance data learning
    Cao, Peng
    Li, Bo
    Zhao, Dazhe
    Zaiane, Osmar
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [8] Online sparse class imbalance learning on big data
    Maurya, Chandresh Kumar
    Toshniwal, Durga
    Venkoparao, Gopalan Vijendran
    NEUROCOMPUTING, 2016, 216 : 250 - 260
  • [9] A Novel Ensemble-Learning-Based Convolution Neural Network for Handling Imbalanced Data
    Wu, Xianbin
    Wen, Chuanbo
    Wang, Zidong
    Liu, Weibo
    Yang, Junjie
    COGNITIVE COMPUTATION, 2024, 16 (01) : 177 - 190
  • [10] A Novel Ensemble-Learning-Based Convolution Neural Network for Handling Imbalanced Data
    Xianbin Wu
    Chuanbo Wen
    Zidong Wang
    Weibo Liu
    Junjie Yang
    Cognitive Computation, 2024, 16 : 177 - 190