Hybrid Firefly Optimised Ensemble Classification for Drifting Data Streams with Imbalance

被引:2
|
作者
Pepsi, M. Blessa Binolin [1 ]
Kumar, N. Senthil [1 ]
机构
[1] Mepco Schlenk Engn Coll, Sivakasi, Tamil Nadu, India
关键词
Class Imbalance; Firefly Optimisation; Oversampling; Classification; Data Stream; Concept Drift; MINORITY CLASS; ALGORITHM; SMOTE;
D O I
10.1016/j.knosys.2024.111500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification learning on non-stationary data may face dynamic changes from time to time. The major problem lies in addressing the imbalance among classes and the substantial cost associated with labeling instances, especially in the presence of drifts. Imbalance is due to a lower number of samples in the minority class than in the majority class. Imbalanced data results in the misclassification of data points. This paper proposes a technique for rebalancing data with an oversampling approach using imputation methods and Hybrid Firefly Optimisation algorithm as a novel classifier to perform classification. Imputation methods improve the number of minority samples on a data chunk. Firefly algorithm is optimised as a classification technique with tuned weights using boosting ensemble classifiers. The proposed system is tested on seven synthetic data and five data stream generators. The evaluation metrics like F-measure, AUC, and G-mean are analyzed to investigate the performance. For weather data with an imbalance ratio of 5%, the G-mean value increases by an average of 0.24% comparatively than existing methods. The statistical Friedman - Nemenyi test proves the stability of the proposed algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Droplet Ensemble Learning on Drifting Data Streams
    Loeffel, Pierre-Xavier
    Bifet, Albert
    Marsala, Christophe
    Detyniecki, Marcin
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVI, IDA 2017, 2017, 10584 : 210 - 222
  • [2] Intensive Class Imbalance Learning in Drifting Data Streams
    Usman, Muhammad
    Chen, Huanhuan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 3503 - 3517
  • [3] Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble
    Sarnovsky M.
    Kolarik M.
    PeerJ Computer Science, 2021, 7 : 1 - 31
  • [4] EMRIL: Ensemble Method based on ReInforcement Learning for binary classification in imbalanced drifting data streams
    Usman, Muhammad
    Chen, Huanhuan
    NEUROCOMPUTING, 2024, 605
  • [5] Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble
    Sarnovsky, Martin
    Kolarik, Michal
    PEERJ COMPUTER SCIENCE, 2021,
  • [6] An adaptive ensemble classifier for mining concept drifting data streams
    Farid, Dewan Md.
    Zhang, Li
    Hossain, Alamgir
    Rahman, Chowdhury Mofizur
    Strachan, Rebecca
    Sexton, Graham
    Dahal, Keshav
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) : 5895 - 5906
  • [7] An Aggregate Ensemble for Mining Concept Drifting Data Streams with Noise
    Zhang, Peng
    Zhu, Xingquan
    Shi, Yong
    Wu, Xindong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 1021 - +
  • [8] GP Boosting Classification on Concept Drifting Data Streams
    Kumar, Dirisala J. Nagendra
    Murthy, J. V. R.
    Satapathy, Suresh Chandra
    Pullela, S. V. V. S. R. Kumar
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 265 - +
  • [9] Structural XML Classification in Concept Drifting Data Streams
    Dariusz Brzezinski
    Maciej Piernik
    New Generation Computing, 2015, 33 : 345 - 366
  • [10] Structural XML Classification in Concept Drifting Data Streams
    Brzezinski, Dariusz
    Piernik, Maciej
    NEW GENERATION COMPUTING, 2015, 33 (04) : 345 - 366