Incremental Rebalancing Learning on Evolving Data Streams

被引:15
|
作者
Bernardo, Alessio [1 ]
Valle, Emanuele Della [1 ]
Bifet, Albert [2 ,3 ]
机构
[1] DEIB Politecn Milano, Milan, Italy
[2] Univ Waikato, Hamilton, New Zealand
[3] Telecom ParisTech, LTCI, Palaiseau, France
关键词
Evolving Data Stream; Streaming; Concept Drift; MOA; Balancing;
D O I
10.1109/ICDMW51313.2020.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, every device connected to the Internet generates an ever-growing (formally, unbounded) stream of data. Machine Learning on data streams is a grand challenge due to its resource constraints. Indeed, standard machine learning techniques are not able to deal with data whose statistics are subject to gradual or sudden changes (formally, concept drift) without any warning. Massive Online Analysis (MOA) is the collective name, as well as a software library, for new learners that can manage data streams. In this paper, we present a research study on streaming rebalancing. Indeed, data streams can be imbalanced as static data, but there is not a method to rebalance them incrementally. For this reason, we propose a new streaming approach able to rebalance data streams online. Our new methodology is evaluated against some synthetically generated datasets using prequential evaluation to demonstrate that it outperforms the existing approaches.
引用
收藏
页码:844 / 850
页数:7
相关论文
共 50 条
  • [31] StreamAR: Incremental and Active Learning with Evolving Sensory Data for Activity Recognition
    Abdallah, Zahraa Said
    Gaber, Mohamed Medhat
    Srinivasan, Bala
    Krishnaswamy, Shonali
    2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 1163 - 1170
  • [32] Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine
    Han, Dong-Hong
    Zhang, Xin
    Wang, Guo-Ren
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (04) : 874 - 887
  • [33] Learning from evolving data streams through ensembles of random patches
    Gomes, Heitor Murilo
    Read, Jesse
    Bifet, Albert
    Durrant, Robert J.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (07) : 1597 - 1625
  • [34] Learning High-Dimensional Evolving Data Streams With Limited Labels
    Din, Salah Ud
    Kumar, Jay
    Shao, Junming
    Mawuli, Cobbinah Bernard
    Ndiaye, Waldiodio David
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 11373 - 11384
  • [35] Learning from evolving data streams through ensembles of random patches
    Heitor Murilo Gomes
    Jesse Read
    Albert Bifet
    Robert J. Durrant
    Knowledge and Information Systems, 2021, 63 : 1597 - 1625
  • [36] Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine
    Dong-Hong Han
    Xin Zhang
    Guo-Ren Wang
    Journal of Computer Science and Technology, 2015, 30 : 874 - 887
  • [37] Incremental modelling for compositional data streams
    Wei, Yuan
    Wang, Huiwen
    Wang, Shanshan
    Saporta, Gilbert
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (08) : 2229 - 2243
  • [38] An Incremental Classifier from Data Streams
    Pratama, Mahardhika
    Anavatti, Sreenatha G.
    Lughofer, Edwin
    ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 15 - 28
  • [39] Dynamic Weighted Majority for Incremental Learning of Imbalanced Data Streams with Concept Drift
    Lu, Yang
    Cheung, Yiu-ming
    Tang, Yuan Yan
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2393 - 2399
  • [40] Incremental rule learning and border examples selection from numerical data streams
    Ferrer-Troyano, FJ
    Aguilar-Ruiz, JS
    Riquelme, JC
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2005, 11 (08) : 1426 - 1439