Uncertain Data Stream Classification with Concept Drift

被引:0
|
作者
Lv Yanxia [1 ]
Wang Cuirong [1 ]
Wang Cong [1 ]
Liu Bingyu [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
来源
2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016) | 2016年
关键词
big data; uncertain data stream; decision tree; classification; concept drift;
D O I
10.1109/CBD.2016.51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big data era, the data on the Internet is growing at an exponential rate. The uncertainty of data due to privacy protection, data loss, network errors and so on is very common. In data stream system, data arrive at continuously and can't be obtained all. In addition, the concept drift occurs often in the data stream. So we need construct an incremental classification model to deal with uncertain data stream classification with concept drift. This paper presented Weighted Bayes based Very Fast Decision Tree for Uncertain data stream with Concept drift-WBVFDTUC algorithm. The algorithm can analyze uncertain information quickly and effectively in both the learning stage and classification stage. In the learning stage, it uses Hoeffding bound theory quickly construct a decision tree model for uncertain data stream. In the classification stage, it uses the weighted Bayes classifier in the tree leaves to improve the performance of the classification. The use of sliding window and replacing tree ensure the algorithm can deal with concept drift phenomenon. Experimental results show that the proposed algorithm can very quickly learn uncertain data stream and improve the classification performance of the model.
引用
收藏
页码:265 / +
页数:7
相关论文
共 50 条
  • [41] Active Learning Method for Imbalanced Concept Drift Data Stream
    Li Y.-H.
    Wang T.-T.
    Wang S.-G.
    Li D.-Y.
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (03): : 589 - 606
  • [42] Multidimensional surrogate stability to detect data stream concept drift
    da Costa, Fausto G.
    Duarte, Felipe S. L. G.
    Vallim, Rosane M. M.
    de Mello, Rodrigo F.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 87 : 15 - 29
  • [43] Concept Drift Detection in Data Stream Mining : A literature review
    Agrahari, Supriya
    Singh, Anil Kumar
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9523 - 9540
  • [44] Data stream mining: methods and challenges for handling concept drift
    Wares, Scott
    Isaacs, John
    Elyan, Eyad
    SN APPLIED SCIENCES, 2019, 1 (11):
  • [45] Novel Class Detection with Concept Drift in Data Stream - AhtNODE
    Gandhi, Jay
    Gandhi, Vaibhav
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2020, 11 (01) : 15 - 26
  • [46] Overview of Wind and Photovoltaic Data Stream Classification and Data Drift Issues
    Zhu, Xinchun
    Wu, Yang
    Zhao, Xu
    Yang, Yunchen
    Liu, Shuangquan
    Shi, Luyi
    Wu, Yelong
    ENERGIES, 2024, 17 (17)
  • [47] Concept Drift Detection in Data Stream Clustering and its Application on Weather Data
    Namitha, K.
    Kumar, Santhosh G.
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 67 - 85
  • [48] Enhanced Intrusion Detection with Data Stream Classification and Concept Drift Guided by the Incremental Learning Genetic Programming Combiner
    Shyaa, Methaq A.
    Zainol, Zurinahni
    Abdullah, Rosni
    Anbar, Mohammed
    Alzubaidi, Laith
    Santamaria, Jose
    SENSORS, 2023, 23 (07)
  • [49] Concept Drift-Based Intrusion Detection For Evolving Data Stream Classification In IDS: Approaches And Comparative Study
    Seth, Sugandh
    Chahal, Kuljit Kaur
    Singh, Gurvinder
    COMPUTER JOURNAL, 2024, 67 (07): : 2529 - 2547
  • [50] Combining active learning with concept drift detection for data stream mining
    Krawczyk, Bartosz
    Pfahringer, Bernhard
    Wozniak, Michal
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2239 - 2244