BENN: Balanced Ensemble Neural Network for Handling Class Imbalance in Big Data

被引：0

作者：

Ramesh, Sneha Halebeedu ^{[1
,2
]}

Basava, Annappa ^{[1
]}

Perumal, Sankar Pariserum ^{[1
]}

机构：

[1] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal, India

[2] Nitte Meenakshi Inst Technol, Dept Informat Sci & Engn, Bengaluru, India

来源：

EXPERT SYSTEMS | 2025年 / 42卷 / 02期

关键词：

concept drift; decision tree regression; decision trees; machine learning; national health dataset; random forest;

D O I：

10.1111/exsy.13754

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Class imbalance is a critical challenge in big data analytics, often leading to biased predictive models. This imbalance can lead to biased models that perform well on the majority class but poorly on the minority class. Many machine learning models tend to be biased towards the majority class because they aim to minimise overall error, often leading to poor performance on the minority class. This paper presents the balanced ensemble neural network, a novel solution to effectively address class imbalance in big data. Balanced ensemble neural network combines the robust capabilities of neural networks with the power of ensemble learning, incorporating class balancing strategies to ensure fair representation of minority classes. The methodology involves integrating multiple neural networks, each trained on balanced subsets of data using techniques like Synthetic Minority Over-sampling Technique and Random Undersampling. This integration aims to leverage the strengths of individual networks while reducing their inherent biases. Our extensive experiments across various datasets reveal that BENN achieves an AUC-ROC score of 0.94, surpassing other models such as random forest (0.88), support vector (0.84) and single neural net (0.80). It was also observed that BENN's performance is better compared to traditional neural network models and standard ensemble methods in key metrics like accuracy, precision, recall, F1-score and AUC-ROC. The results specifically highlight BENN's effectiveness in accurately classifying instances of minority classes, a notable challenge in many existing models. These findings underscore BENN's potential as a substantial advancement in handling class imbalance within big data environments, offering a promising direction for future research and application in machine learning.

引用

页数：13

共 50 条

[1] Feedforward neural network models for handling class overlap and class imbalance
Kretzschmar, R
Karayiannis, NB
Eggimann, F
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (05) : 323 - 338
[2] DUEN: Dynamic ensemble handling class imbalance in network intrusion detection
Ren, Huajuan
Tang, Yonghe
Dong, Weiyu
Ren, Shuai
Jiang, Liehui
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
[3] Hybrid Deep Neural Network for Handling Data Imbalance in Precursor MicroRNA
Elakkiya, R.
Jain, Deepak Kumar
Kotecha, Ketan
Pandya, Sharnil
Reddy, Sai Siddhartha
Rajalakshmi, E.
Varadarajan, Vijayakumar
Mahanti, Aniket
Subramaniyaswamy, Subramaniyaswamy
FRONTIERS IN PUBLIC HEALTH, 2021, 9
[4] A hybrid method based on ensemble WELM for handling multi class imbalance in cancer microarray data
Liu, Zhen
Tang, Deyu
Cai, Yongming
Wang, Ruoyu
Chen, Fuhua
NEUROCOMPUTING, 2017, 266 : 641 - 650
[5] Modeling of class imbalance handling with optimal deep learning enabled big data classification model
Varshavardhini, S.
Rajesh, A.
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (04): : 1179 - 1197
[6] Experimental evaluation of ensemble classifiers for imbalance in Big Data
Juez-Gil M.
Arnaiz-González Á.
Rodríguez J.J.
García-Osorio C.
Applied Soft Computing, 2021, 108
[7] A novel cost sensitive neural network ensemble for multiclass imbalance data learning
Cao, Peng
Li, Bo
Zhao, Dazhe
Zaiane, Osmar
2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[8] Online sparse class imbalance learning on big data
Maurya, Chandresh Kumar
Toshniwal, Durga
Venkoparao, Gopalan Vijendran
NEUROCOMPUTING, 2016, 216 : 250 - 260
[9] A Novel Ensemble-Learning-Based Convolution Neural Network for Handling Imbalanced Data
Wu, Xianbin
Wen, Chuanbo
Wang, Zidong
Liu, Weibo
Yang, Junjie
COGNITIVE COMPUTATION, 2024, 16 (01) : 177 - 190
[10] A Novel Ensemble-Learning-Based Convolution Neural Network for Handling Imbalanced Data
Xianbin Wu
Chuanbo Wen
Zidong Wang
Weibo Liu
Junjie Yang
Cognitive Computation, 2024, 16 : 177 - 190

← 1 2 3 4 5 →