Incremental Rebalancing Learning on Evolving Data Streams

被引:15
|
作者
Bernardo, Alessio [1 ]
Valle, Emanuele Della [1 ]
Bifet, Albert [2 ,3 ]
机构
[1] DEIB Politecn Milano, Milan, Italy
[2] Univ Waikato, Hamilton, New Zealand
[3] Telecom ParisTech, LTCI, Palaiseau, France
关键词
Evolving Data Stream; Streaming; Concept Drift; MOA; Balancing;
D O I
10.1109/ICDMW51313.2020.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, every device connected to the Internet generates an ever-growing (formally, unbounded) stream of data. Machine Learning on data streams is a grand challenge due to its resource constraints. Indeed, standard machine learning techniques are not able to deal with data whose statistics are subject to gradual or sudden changes (formally, concept drift) without any warning. Massive Online Analysis (MOA) is the collective name, as well as a software library, for new learners that can manage data streams. In this paper, we present a research study on streaming rebalancing. Indeed, data streams can be imbalanced as static data, but there is not a method to rebalance them incrementally. For this reason, we propose a new streaming approach able to rebalance data streams online. Our new methodology is evaluated against some synthetically generated datasets using prequential evaluation to demonstrate that it outperforms the existing approaches.
引用
收藏
页码:844 / 850
页数:7
相关论文
共 50 条
  • [1] Adaptive online incremental learning for evolving data streams
    Zhang, Si -si
    Liu, Jian-wei
    Zuo, Xin
    APPLIED SOFT COMPUTING, 2021, 105
  • [2] Efficient Class Incremental Learning for Multi-label Classification of Evolving Data Streams
    Shi, Zhongwei
    Wen, Yimin
    Xue, Yun
    Cai, Guoyong
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2093 - 2099
  • [3] Incremental multi-label classification of evolving data streams
    Yin, Zhiwu
    Huang, Shangteng
    Journal of Computational Information Systems, 2007, 3 (06): : 2189 - 2193
  • [4] Incremental Learning Algorithm for Dynamic Data Streams
    Kuthadi, Venu Madhav
    Govardhan, A.
    Chand, P. Prem
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (09): : 338 - 345
  • [5] Adaptive Learning from Evolving Data Streams
    Bifet, Albert
    Gavalda, Ricard
    ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 249 - 260
  • [6] Kalman Filtering for Learning with Evolving Data Streams
    Ziffer, Giacomo
    Bernardo, Alessio
    Della Valle, Emanuele
    Bifet, Albert
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5337 - 5346
  • [7] SMOTE-OB: Combining SMOTE and Online Bagging for Continuous Rebalancing of Evolving Data Streams
    Bernardo, Alessio
    Della Valle, Emanuele
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5033 - 5042
  • [8] Lambda Learner: Fast Incremental Learning on Data Streams
    Ramanath, Rohan
    Salomatin, Konstantin
    Gee, Jeffrey D.
    Talanine, Kirill
    Dalal, Onkar
    Polatkan, Gungor
    Smoot, Sara
    Kumar, Deepak
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3492 - 3502
  • [9] Learning model trees from evolving data streams
    Ikonomovska, Elena
    Gama, Joao
    Dzeroski, Saso
    DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 23 (01) : 128 - 168
  • [10] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    DISCOVERY SCIENCE, 2013, 8140 : 79 - 93