DynaQ: online learning from imbalanced multi-class streams through dynamic sampling

被引:3
|
作者
Sadeghi, Farnaz [1 ]
Viktor, Herna L. [1 ]
Vafaie, Parsa [1 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward Rd, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Online learning; Multi-class imbalance; Data streams; Ensembles; Concept drift; CLASSIFICATION; ENSEMBLE; INFORMATION; CHALLENGES; SELECTION;
D O I
10.1007/s10489-023-04886-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online supervised learning from fast-evolving data streams, particularly in domains such as health, the environment, and manufacturing, is a crucial research area. However, these domains often experience class imbalance, which can skew class distributions. It is essential for online learning algorithms to analyze large datasets in real-time while accurately modeling rare or infrequent classes that may appear in bursts. While methods have been proposed to handle binary class imbalance, there is a lack of attention to multi-class imbalanced settings with varying degrees of imbalance in evolving streams. In this paper, we present the Dynamic Queues (DynaQ) algorithm for online learning in multi-class imbalanced settings to fill this knowledge gap. Our approach utilizes a batch-based resampling method that creates an instance queue for each class to balance the number of instances. We maintain a queue threshold and remove older samples during training. Additionally, we dynamically oversample minority classes based on one of four rate parameters: recall, F1-score, ?m, and Euclidean distance. Our learning algorithm consists of an ensemble that uses sliding windows and a soft voting schema while incorporating a drift detection mechanism. Our experimental results demonstrate the superiority of the DynaQ approach over state-of-the-art methods.
引用
收藏
页码:24908 / 24930
页数:23
相关论文
共 50 条
  • [21] Multi-class Ensemble Learning of Imbalanced Bidding Fraud Data
    Anowar, Farzana
    Sadaoui, Samira
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 352 - 358
  • [22] Re-sampling of multi-class imbalanced data using belief function theory and ensemble learning
    Grina, Fares
    Elouedi, Zied
    Lefevre, Eric
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 156 : 1 - 15
  • [23] An active learning budget-based oversampling approach for partially labeled multi-class imbalanced data streams
    Aguiar, Gabriel J.
    Cano, Alberto
    [J]. 38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 382 - 389
  • [24] Online Learning From Incomplete and Imbalanced Data Streams
    You, Dianlong
    Xiao, Jiawei
    Wang, Yang
    Yan, Huigui
    Wu, Di
    Chen, Zhen
    Shen, Limin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10650 - 10665
  • [25] Multi-class Boosting for Imbalanced Data
    Fernandez-Baldera, Antonio
    Buenaposada, Jose M.
    Baumela, Luis
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 57 - 64
  • [26] Evolutionary inversion of class distribution in overlapping areas for multi-class imbalanced learning
    Fernandes, Everlandio R. Q.
    de Carvalho, Andre C. P. L. F.
    [J]. INFORMATION SCIENCES, 2019, 494 : 141 - 154
  • [27] Fast Learning and Testing for Imbalanced Multi-Class Changes in Streaming Data by Dynamic Multi-Stratum Network
    Thakong, Mongkhon
    Phimoltares, Suphakant
    Jaiyen, Saichon
    Lursinsap, Chidchanok
    [J]. IEEE ACCESS, 2017, 5 : 10633 - 10648
  • [28] To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques
    Abdi, Lida
    Hashemi, Sattar
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 238 - 251
  • [29] SCUT-DS: Learning from Multi-class Imbalanced Canadian Weather Data
    Olaitan, Olubukola M.
    Viktor, Herna L.
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 291 - 301
  • [30] Online discriminative graph learning from multi-class smooth signals
    Saboksayr, Seyed Saman
    Mateos, Gonzalo
    Cetin, Mujdat
    [J]. SIGNAL PROCESSING, 2021, 186