Incremental Rebalancing Learning on Evolving Data Streams

被引:15
|
作者
Bernardo, Alessio [1 ]
Valle, Emanuele Della [1 ]
Bifet, Albert [2 ,3 ]
机构
[1] DEIB Politecn Milano, Milan, Italy
[2] Univ Waikato, Hamilton, New Zealand
[3] Telecom ParisTech, LTCI, Palaiseau, France
关键词
Evolving Data Stream; Streaming; Concept Drift; MOA; Balancing;
D O I
10.1109/ICDMW51313.2020.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, every device connected to the Internet generates an ever-growing (formally, unbounded) stream of data. Machine Learning on data streams is a grand challenge due to its resource constraints. Indeed, standard machine learning techniques are not able to deal with data whose statistics are subject to gradual or sudden changes (formally, concept drift) without any warning. Massive Online Analysis (MOA) is the collective name, as well as a software library, for new learners that can manage data streams. In this paper, we present a research study on streaming rebalancing. Indeed, data streams can be imbalanced as static data, but there is not a method to rebalance them incrementally. For this reason, we propose a new streaming approach able to rebalance data streams online. Our new methodology is evaluated against some synthetically generated datasets using prequential evaluation to demonstrate that it outperforms the existing approaches.
引用
收藏
页码:844 / 850
页数:7
相关论文
共 50 条
  • [11] Learning model trees from evolving data streams
    Elena Ikonomovska
    João Gama
    Sašo Džeroski
    Data Mining and Knowledge Discovery, 2011, 23 : 128 - 168
  • [12] Learning Patterns from Imbalanced Evolving Data Streams
    Almuammar, Manal
    Fasli, Maria
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2048 - 2057
  • [13] Incremental density-based ensemble clustering over evolving data streams
    Khan, Imran
    Huang, Joshua Z.
    Ivanov, Kamen
    NEUROCOMPUTING, 2016, 191 : 34 - 43
  • [14] Efficient Batch-Incremental Classification Using UMAP for Evolving Data Streams
    Bahri, Maroua
    Pfahringer, Bernhard
    Bifet, Albert
    Maniu, Silviu
    ADVANCES IN INTELLIGENT DATA ANALYSIS XVIII, IDA 2020, 2020, 12080 : 40 - 53
  • [15] Incremental learning and granular computing from evolving data streams: An application to speech-based bipolar disorder diagnosis
    Leite, Daniel
    Casalino, Gabriella
    Kaczmarek-Majer, Katarzyna
    Castellano, Giovanna
    FUZZY SETS AND SYSTEMS, 2025, 500
  • [16] Class Imbalance Robust Incremental LPSVM for Data Streams Learning
    Zhu, Lei
    Pang, Shaoning
    Chen, Gang
    Sarrafzadeh, Abdolhossein
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [17] Dynamic incremental SVM learning algorithm for mining, data streams
    Li, Zhong-Wei
    Yang, Jrng
    Zhang, Jian-Pei
    PROCEEDINGS OF THE FIRST INTERNATIONAL SYMPOSIUM ON DATA, PRIVACY, AND E-COMMERCE, 2007, : 35 - +
  • [18] Semi-supervised federated learning on evolving data streams
    Mawuli, Cobbinah B.
    Kumar, Jay
    Nanor, Ebenezer
    Fu, Shangxuan
    Pan, Liangxu
    Yang, Qinli
    Zhang, Wei
    Shao, Junming
    INFORMATION SCIENCES, 2023, 643
  • [19] Recurring concept meta-learning for evolving data streams
    Anderson, Robert
    Koh, Yun Sing
    Dobbie, Gillian
    Bifet, Albert
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 138
  • [20] Meta Expert Learning and Efficient Pruning for Evolving Data Streams
    Azarafrooz, Mahdi
    Daneshmand, Mahmoud
    IEEE INTERNET OF THINGS JOURNAL, 2015, 2 (04): : 268 - 273