Online Learning from Capricious Data Streams: A Generative Approach

被引:0
|
作者
He, Yi [1 ]
Wu, Baijun [1 ]
Wu, Di [2 ]
Beyazit, Ege [1 ]
Chen, Sheng [1 ]
Wu, Xindong [1 ]
机构
[1] Univ Louisiana Lafayette, Sch Comp & Informat, Lafayette, LA 70504 USA
[2] Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning with streaming data has received extensive attention during the past few years. Existing approaches assume the feature space is fixed or changes by following explicit regularities, limiting their applicability in dynamic environments where the data streams are described by an arbitrarily varying feature space. To handle such capricious data streams, we in this paper develop a novel algorithm, named OCDS (Online learning from Capricious Data Streams), which does not make any assumption on feature space dynamics. OCDS trains a learner on a universal feature space that establishes relationships between old and new features, so that the patterns learned in the old feature space can be used in the new feature space. Specifically, the universal feature space is constructed by leveraging the relatednesses among features. We propose a generative graphical model to model the construction process, and show that learning from the universal feature space can effectively improve the performance with theoretical analysis. The experimental results demonstrate that OCDS achieves conspicuous performance on both synthetic and real datasets.
引用
收藏
页码:2491 / 2497
页数:7
相关论文
共 50 条
  • [31] A Stable and Online Approach to Detect Concept Drift in Data Streams
    da Costa, Fausto Guzzo
    de Mello, Rodrigo Fernandes
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 330 - 335
  • [32] A Novel Online Ensemble Approach for Concept Drift in Data Streams
    Sidhu, Parneeta
    Bhatia, M. P. S.
    Bindal, Aditya
    [J]. 2013 IEEE SECOND INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2013, : 550 - 555
  • [33] Online reliable semi-supervised learning on evolving data streams
    Din, Salah Ud
    Shao, Junming
    Kumar, Jay
    Ali, Waqar
    Liu, Jiaming
    Ye, Yu
    [J]. INFORMATION SCIENCES, 2020, 525 (525) : 153 - 171
  • [34] The online performance estimation framework: heterogeneous ensemble learning for data streams
    Jan N. van Rijn
    Geoffrey Holmes
    Bernhard Pfahringer
    Joaquin Vanschoren
    [J]. Machine Learning, 2018, 107 : 149 - 176
  • [35] Reinforcement Online Active Learning Ensemble for Drifting Imbalanced Data Streams
    Zhang, Hang
    Liu, Weike
    Liu, Qingbao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) : 3971 - 3983
  • [36] Online ensemble learning with abstaining classifiers for drifting and noisy data streams
    Krawczyk, Bartosz
    Cano, Alberto
    [J]. APPLIED SOFT COMPUTING, 2018, 68 : 677 - 692
  • [37] The online performance estimation framework: heterogeneous ensemble learning for data streams
    van Rijn, Jan N.
    Holmes, Geoffrey
    Pfahringer, Bernhard
    Vanschoren, Joaquin
    [J]. MACHINE LEARNING, 2018, 107 (01) : 149 - 176
  • [38] Online real-time learning strategies for data streams for Neurocomputing
    Pratama, Mahardhika
    Lughofer, Edwin
    Wang, Dianhui
    [J]. NEUROCOMPUTING, 2017, 262 : 1 - 3
  • [39] Personalized online ensemble machine learning with applications for dynamic data streams
    Malenica, Ivana
    Phillips, Rachael V. V.
    Chambaz, Antoine
    Hubbard, Alan E. E.
    Pirracchio, Romain
    van der Laan, Mark J. J.
    [J]. STATISTICS IN MEDICINE, 2023, 42 (07) : 1013 - 1044
  • [40] Semi-supervised Learning Algorithm for Online Electricity Data Streams
    Patil, Pramod
    Fatangare, Yogita
    Kulkarni, Parag
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY ALGORITHMS IN ENGINEERING SYSTEMS, VOL 1, 2015, 324 : 349 - 358