Learning model trees from evolving data streams

被引:193
|
作者
Ikonomovska, Elena [1 ,4 ]
Gama, Joao [2 ,3 ]
Dzeroski, Saso [1 ]
机构
[1] Jozef Stefan Inst, Ljubljana 1000, Slovenia
[2] Univ Porto, LIAAD INESC, P-4050190 Oporto, Portugal
[3] Univ Porto, Fac Econ, P-4200 Oporto, Portugal
[4] Ss Cyril & Methodius Univ, Fac Elect Engn & Informat Technol, Skopje 1000, Macedonia
关键词
Non-stationary data streams; Stream data mining; Regression trees; Model trees; Incremental algorithms; On-line learning; Concept drift; On-line change detection; REGRESSION TREES; DRIFT;
D O I
10.1007/s10618-010-0201-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of real-time extraction of meaningful patterns from time-changing data streams is of increasing importance for the machine learning and data mining communities. Regression in time-changing data streams is a relatively unexplored topic, despite the apparent applications. This paper proposes an efficient and incremental stream mining algorithm which is able to learn regression and model trees from possibly unbounded, high-speed and time-changing data streams. The algorithm is evaluated extensively in a variety of settings involving artificial and real data. To the best of our knowledge there is no other general purpose algorithm for incremental learning regression/model trees able to perform explicit change detection and informed adaptation. The algorithm performs online and in real-time, observes each example only once at the speed of arrival, and maintains at any-time a ready-to-use model tree. The tree leaves contain linear models induced online from the examples assigned to them, a process with low complexity. The algorithm has mechanisms for drift detection and model adaptation, which enable it to maintain accurate and updated regression models at any time. The drift detection mechanism exploits the structure of the tree in the process of local change detection. As a response to local drift, the algorithm is able to update the tree structure only locally. This approach improves the any-time performance and greatly reduces the costs of adaptation.
引用
收藏
页码:128 / 168
页数:41
相关论文
共 50 条
  • [31] Discovery of evolving companion from trajectory data streams
    Thi Thi Shein
    Sutheera Puntheeranurak
    Makoto Imamura
    Knowledge and Information Systems, 2020, 62 : 3509 - 3533
  • [32] Discovery of evolving companion from trajectory data streams
    Thi Thi Shein
    Puntheeranurak, Sutheera
    Imamura, Makoto
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (09) : 3509 - 3533
  • [33] Online tree-based ensembles and option trees for regression on evolving data streams
    Ikonomovska, Elena
    Gama, Joao
    Dzeroski, Saso
    NEUROCOMPUTING, 2015, 150 : 458 - 470
  • [34] A Method For Evolving Data Streams
    Wankhade, Kapil
    Hasan, Tasneem
    Thool, Ravindra
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1619 - 1622
  • [35] Adaptive Model Compression of Ensembles for Evolving Data Streams Forecasting
    Boulegane, Dihia
    Cerquiera, Vitor
    Bifet, Albert
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] Network of Experts: Learning from Evolving Data Streams Through Network-Based Ensembles
    Gomes, Heitor Murilo
    Bifet, Albert
    Fournier-Viger, Philippe
    Granatyr, Jones
    Read, Jesse
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 704 - 716
  • [37] Online reliable semi-supervised learning on evolving data streams
    Din, Salah Ud
    Shao, Junming
    Kumar, Jay
    Ali, Waqar
    Liu, Jiaming
    Ye, Yu
    INFORMATION SCIENCES, 2020, 525 (525) : 153 - 171
  • [38] Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine
    Han, Dong-Hong
    Zhang, Xin
    Wang, Guo-Ren
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (04) : 874 - 887
  • [39] Learning High-Dimensional Evolving Data Streams With Limited Labels
    Din, Salah Ud
    Kumar, Jay
    Shao, Junming
    Mawuli, Cobbinah Bernard
    Ndiaye, Waldiodio David
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 11373 - 11384
  • [40] Classifying Uncertain and Evolving Data Streams with Distributed Extreme Learning Machine
    Dong-Hong Han
    Xin Zhang
    Guo-Ren Wang
    Journal of Computer Science and Technology, 2015, 30 : 874 - 887