Multi-class imbalanced semi-supervised learning from streams through online ensembles

被引:11
|
作者
Vafaie, Parsa [1 ]
Viktor, Herna [1 ]
Michalowski, Wojtek [2 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON, Canada
[2] Univ Ottawa, Telfer Sch Management, Ottawa, ON, Canada
关键词
D O I
10.1109/ICDMW51313.2020.00124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-class imbalance, in which the rates of instances in the various classes differ substantially, poses a major challenge when learning from evolving streams. In this setting, minority class instances may arrive infrequently and in bursts, making accurate model construction problematic. Further, skewed streams are not only susceptible to concept drifts, but class labels may also be absent, expensive to obtain, or only arrive after some delay. The combined effects of multi-class skew, concept drift and semi-supervised learning have received limited attention in the online learning community. In this paper, we introduce a multi-class online ensemble algorithm that is suitable for learning in such settings. Specifically, our algorithm uses sampling with replacement while dynamically increasing the weights of underrepresented classes based on recall in order to produce models that benefit all classes. Our approach addresses the potential lack of labels by incorporating a self-training semi-supervised learning method for labeling instances. Our experimental results show that our online ensemble performs well against multi-class imbalanced data containing concept drifts. In addition, our algorithm produces accurate predictions, even in the presence of unlabeled data.
引用
收藏
页码:867 / 874
页数:8
相关论文
共 50 条
  • [1] DynaQ: online learning from imbalanced multi-class streams through dynamic sampling
    Farnaz Sadeghi
    Herna L. Viktor
    Parsa Vafaie
    [J]. Applied Intelligence, 2023, 53 : 24908 - 24930
  • [2] DynaQ: online learning from imbalanced multi-class streams through dynamic sampling
    Sadeghi, Farnaz
    Viktor, Herna L.
    Vafaie, Parsa
    [J]. APPLIED INTELLIGENCE, 2023, 53 (21) : 24908 - 24930
  • [3] SIMILARITY LEARNING FOR SEMI-SUPERVISED MULTI-CLASS BOOSTING
    Wang, Q. Y.
    Yuen, P. C.
    Feng, G. C.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2164 - 2167
  • [4] Online-MC-Queue: Learning from Imbalanced Multi-Class Streams
    Sadeghi, Farnaz
    Viktor, Herna L.
    [J]. THIRD INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 154, 2021, 154 : 21 - 34
  • [5] Semi-supervised Learning Algorithm Based on Linear Lie Group for Imbalanced Multi-class Classification
    Xu, Chengjun
    Zhu, Guobin
    [J]. NEURAL PROCESSING LETTERS, 2020, 52 (01) : 869 - 889
  • [6] Semi-supervised Learning Algorithm Based on Linear Lie Group for Imbalanced Multi-class Classification
    Chengjun Xu
    Guobin Zhu
    [J]. Neural Processing Letters, 2020, 52 : 869 - 889
  • [7] Online semi-supervised active learning ensemble classification for evolving imbalanced data streams
    Guo, Yinan
    Pu, Jiayang
    Jiao, Botao
    Peng, Yanyan
    Wang, Dini
    Yang, Shengxiang
    [J]. APPLIED SOFT COMPUTING, 2024, 155
  • [8] Regularized Multi-Class Semi-Supervised Boosting
    Saffari, Amir
    Leistner, Christian
    Bischof, Horst
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 967 - 974
  • [9] Semi-Supervised Boosting for Multi-Class Classification
    Valizadegan, Hamed
    Jin, Rong
    Jain, Anil K.
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 522 - 537
  • [10] Improving Multi-class Classification for Endomicroscopic Images by Semi-supervised Learning
    Wu, Hang
    Tong, Li
    Wang, May D.
    [J]. 2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 5 - 8