DynaQ: online learning from imbalanced multi-class streams through dynamic sampling

被引:3
|
作者
Sadeghi, Farnaz [1 ]
Viktor, Herna L. [1 ]
Vafaie, Parsa [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward Rd, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Online learning; Multi-class imbalance; Data streams; Ensembles; Concept drift; CLASSIFICATION; ENSEMBLE; INFORMATION; CHALLENGES; SELECTION;
D O I
10.1007/s10489-023-04886-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online supervised learning from fast-evolving data streams, particularly in domains such as health, the environment, and manufacturing, is a crucial research area. However, these domains often experience class imbalance, which can skew class distributions. It is essential for online learning algorithms to analyze large datasets in real-time while accurately modeling rare or infrequent classes that may appear in bursts. While methods have been proposed to handle binary class imbalance, there is a lack of attention to multi-class imbalanced settings with varying degrees of imbalance in evolving streams. In this paper, we present the Dynamic Queues (DynaQ) algorithm for online learning in multi-class imbalanced settings to fill this knowledge gap. Our approach utilizes a batch-based resampling method that creates an instance queue for each class to balance the number of instances. We maintain a queue threshold and remove older samples during training. Additionally, we dynamically oversample minority classes based on one of four rate parameters: recall, F1-score, ?m, and Euclidean distance. Our learning algorithm consists of an ensemble that uses sliding windows and a soft voting schema while incorporating a drift detection mechanism. Our experimental results demonstrate the superiority of the DynaQ approach over state-of-the-art methods.
引用
收藏
页码:24908 / 24930
页数:23
相关论文
共 50 条
  • [1] DynaQ: online learning from imbalanced multi-class streams through dynamic sampling
    Farnaz Sadeghi
    Herna L. Viktor
    Parsa Vafaie
    [J]. Applied Intelligence, 2023, 53 : 24908 - 24930
  • [2] Multi-class imbalanced semi-supervised learning from streams through online ensembles
    Vafaie, Parsa
    Viktor, Herna
    Michalowski, Wojtek
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2020), 2020, : 867 - 874
  • [3] Online-MC-Queue: Learning from Imbalanced Multi-Class Streams
    Sadeghi, Farnaz
    Viktor, Herna L.
    [J]. THIRD INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 154, 2021, 154 : 21 - 34
  • [4] A Dynamic Sampling Framework for Multi-Class Imbalanced Data
    Debowski, B.
    Areibi, S.
    Grewal, G.
    Tempelman, J.
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 113 - 118
  • [5] Concept Drift Detection from Multi-Class Imbalanced Data Streams
    Korycki, Lukasz
    Krawczyk, Bartosz
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1068 - 1079
  • [6] Online active learning method for multi-class imbalanced data stream
    Ang Li
    Meng Han
    Dongliang Mu
    Zhihui Gao
    Shujuan Liu
    [J]. Knowledge and Information Systems, 2024, 66 : 2355 - 2391
  • [7] Online active learning method for multi-class imbalanced data stream
    Li, Ang
    Han, Meng
    Mu, Dongliang
    Gao, Zhihui
    Liu, Shujuan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (04) : 2355 - 2391
  • [8] Dynamic ensemble selection for multi-class imbalanced datasets
    Garcia, Salvador
    Zhang, Zhong-Liang
    Altalhi, Abdulrahman
    Alshomrani, Saleh
    Herrera, Francisco
    [J]. INFORMATION SCIENCES, 2018, 445 : 22 - 37
  • [9] OAHO: an effective algorithm for multi-class learning from imbalanced data
    Murphey, Yi L.
    Wang, Haoxing
    Ou, Guobin
    Feldkamp, Lee A.
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 406 - +
  • [10] Learning from Combination of Data Chunks for Multi-class Imbalanced Data
    Liu, Xu-Ying
    Li, Qian-Qian
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1680 - 1687