Online Automated Machine Learning for Class Imbalanced Data Streams

被引:0
|
作者
Wang, Zhaoyang [1 ]
Wang, Shuo [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham, England
关键词
Automated machine learning; class imbalance; data stream; evolutionary learning; ensemble learning;
D O I
10.1109/IJCNN54540.2023.10191926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated machine learning (AutoML) has achieved great success in offline class imbalance learning where data are static. However, many real world applications data nowadays tend to evolve over time in the form of data streams and involve class imbalance distributions, e.g., intrusion detection, fault diagnosis systems, and fraud detection. These learning tasks require AutoML processing the instances instantly and adapting to the dynamic data changes. Nevertheless, existing AutoML research either only focuses on class imbalance in static data sets, or discusses data streams with concept drift. No existing work studied the joint learning challenges of class imbalance and online data stream learning in AutoML. To close the gap, this paper focuses on learning dynamic data streams with a skewed class distribution in AutoML. In this paper, we propose two new AutoML approaches, UEvoAutoML and OEvoAutoML, which integrate adaptive resampling techniques into an existing online AutoML framework. Their performance is investigated through a set of synthetic imbalanced data streams under various stationary and non-stationary scenarios and 5 real-world data streams. As the pioneering work of exploring how class imbalance techniques benefit online AutoML, this paper demonstrated that the effectiveness of adaptive resampling in AutoML frameworks.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Online Learning From Incomplete and Imbalanced Data Streams
    You, Dianlong
    Xiao, Jiawei
    Wang, Yang
    Yan, Huigui
    Wu, Di
    Chen, Zhen
    Shen, Limin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10650 - 10665
  • [2] Evolutionary Online Machine Learning from Imbalanced Data
    Stein, Anthony
    [J]. 2016 IEEE 1ST INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2016, : 281 - 286
  • [3] Reinforcement Online Active Learning Ensemble for Drifting Imbalanced Data Streams
    Zhang, Hang
    Liu, Weike
    Liu, Qingbao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3971 - 3983
  • [4] Imbal-OL: Online Machine Learning from Imbalanced Data Streams in Real-world IoT
    Sudharsan, Bharath
    Breslin, John G.
    Ali, Muhammad Intizar
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4974 - 4978
  • [5] Online Bagging and Boosting for Imbalanced Data Streams
    Wang, Boyu
    Pineau, Joelle
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3353 - 3366
  • [6] Online-MC-Queue: Learning from Imbalanced Multi-Class Streams
    Sadeghi, Farnaz
    Viktor, Herna L.
    [J]. THIRD INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 154, 2021, 154 : 21 - 34
  • [7] Cost-sensitive sparse group online learning for imbalanced data streams
    Chen, Zhong
    Sheng, Victor
    Edwards, Andrea
    Zhang, Kun
    [J]. MACHINE LEARNING, 2024, 113 (07) : 4407 - 4444
  • [8] Online sequential class-specific extreme learning machine for binary imbalanced learning
    Shukla, Sanyam
    Raghuwanshi, Bhagat Singh
    [J]. NEURAL NETWORKS, 2019, 119 : 235 - 248
  • [9] DynaQ: online learning from imbalanced multi-class streams through dynamic sampling
    Sadeghi, Farnaz
    Viktor, Herna L.
    Vafaie, Parsa
    [J]. APPLIED INTELLIGENCE, 2023, 53 (21) : 24908 - 24930
  • [10] DynaQ: online learning from imbalanced multi-class streams through dynamic sampling
    Farnaz Sadeghi
    Herna L. Viktor
    Parsa Vafaie
    [J]. Applied Intelligence, 2023, 53 : 24908 - 24930