Meta Expert Learning and Efficient Pruning for Evolving Data Streams

被引:0
|
作者
Azarafrooz, Mahdi [1 ]
Daneshmand, Mahmoud [2 ]
机构
[1] Stevens Inst Technol, Howe Sch Technol Management, Business Intelligence & Analyt, Hoboken, NJ 07030 USA
[2] Stevens Inst Technol, Schaefer Sch Engn & Sci, Dept Comp Sci, Business Intelligence & Analyt,Howe Sch Technol M, Hoboken, NJ 07030 USA
来源
IEEE INTERNET OF THINGS JOURNAL | 2015年 / 2卷 / 04期
关键词
WEIGHTED-MAJORITY; TRACKING;
D O I
10.1109/JIOT.2015.2420689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Researchers have proposed several ensemble methods for the data stream environments including online bagging and boosting. These studies show that bagging methods perform better than boosting methods although the opposite is known to be true in the batch setting environments. The reason behind the weaker performance of boosting methods in the streaming environments is not clear. We have taken advantage of the algorithmic procedure of meta expert learnings for the sake of our study. The meta expert learning differs from the classic expert learning methods in that each expert starts to predict from a different point in the history. Moreover, maintaining a collection of base learners follows an algorithmic procedure. The focus of this paper is on studying the pruning function for maintaining the appropriate set of experts rather than proposing a competitive algorithm for selecting the experts. It shows how a well-structured pruning method leads to a better prediction accuracy without necessary higher memory consumption. Next, it is shown how pruning the set of base learners in the meta expert learning (in order to avoid memory exhaustion) affects the prediction accuracy for different types of drifts. In the case of time-locality drifts, the prediction accuracy is highly tied to the mathematical structure of the pruning algorithms. This observation may explain the main reason behind the weak performance of previously studied boosting methods in the streaming environments. It shows that the boosting algorithms should be designed with respect to the suitable notion of the regret metrics.
引用
收藏
页码:268 / 273
页数:6
相关论文
共 50 条
  • [31] On change diagnosis in evolving data streams
    Aggarwal, CC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (05) : 587 - 600
  • [32] Leveraging Bagging for Evolving Data Streams
    Bifet, Albert
    Holmes, Geoff
    Pfahringer, Bernhard
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 135 - 150
  • [33] Dynamically Evolving Clustering for Data Streams
    Baruah, Rashmi Dutta
    Angelov, Plamen
    Baruah, Diganta
    2014 IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2014,
  • [34] Ensemble Diversity in Evolving Data Streams
    Brzezinski, Dariusz
    Stefanowski, Jerzy
    DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 229 - 244
  • [35] Adaptive XGBoost for Evolving Data Streams
    Montiel, Jacob
    Mitchell, Rory
    Frank, Eibe
    Pfahringer, Bernhard
    Abdessalem, Talel
    Bifet, Albert
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [36] Online Learning and Prediction of Data Streams using Dynamically Evolving Fuzzy Approach
    Baruah, Rashmi Dutta
    Angelov, Plamen
    2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [37] Evolving Spiking Neural Networks for online learning over drifting data streams
    Lobo, Jesus L.
    Lana, Ibai
    Del Ser, Javier
    Bilbao, Miren Nekane
    Kasabov, Nikola
    NEURAL NETWORKS, 2018, 108 : 1 - 19
  • [38] Stream mining with integrity constraint learning for event extraction in evolving data streams
    Calvo Martinez, John
    Wobcke, Wayne
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (03) : 2595 - 2618
  • [39] Active Learning over Evolving Data Streams using Paired Ensemble Framework
    Xu, Wenhua
    Zhao, Fengfei
    Lu, Zhengcai
    2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2016, : 180 - 185
  • [40] Efficient approximation and privacy preservation algorithms for real time online evolving data streams
    Rahul A. Patil
    Pramod D. Patil
    World Wide Web, 2024, 27