Uncertain Data Stream Classification with Concept Drift

被引:0
|
作者
Lv Yanxia [1 ]
Wang Cuirong [1 ]
Wang Cong [1 ]
Liu Bingyu [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
关键词
big data; uncertain data stream; decision tree; classification; concept drift;
D O I
10.1109/CBD.2016.51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big data era, the data on the Internet is growing at an exponential rate. The uncertainty of data due to privacy protection, data loss, network errors and so on is very common. In data stream system, data arrive at continuously and can't be obtained all. In addition, the concept drift occurs often in the data stream. So we need construct an incremental classification model to deal with uncertain data stream classification with concept drift. This paper presented Weighted Bayes based Very Fast Decision Tree for Uncertain data stream with Concept drift-WBVFDTUC algorithm. The algorithm can analyze uncertain information quickly and effectively in both the learning stage and classification stage. In the learning stage, it uses Hoeffding bound theory quickly construct a decision tree model for uncertain data stream. In the classification stage, it uses the weighted Bayes classifier in the tree leaves to improve the performance of the classification. The use of sliding window and replacing tree ensure the algorithm can deal with concept drift phenomenon. Experimental results show that the proposed algorithm can very quickly learn uncertain data stream and improve the classification performance of the model.
引用
收藏
页码:265 / +
页数:7
相关论文
共 50 条
  • [31] Detecting concept drift using HEDDM in data stream
    Dongre, Snehlata S.
    Malik, Latesh G.
    Thomas, Achamma
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2019, 7 (2-3) : 164 - 179
  • [32] Detecting algorithm of concept drift from stream data
    Zhang, Jie
    Zhao, Feng
    Kongzhi yu Juece/Control and Decision, 2013, 28 (01): : 29 - 35
  • [33] Concept drift detection on stream data for revising DBSCAN
    Miyata Y.
    Ishikawa H.
    IEEJ Transactions on Electronics, Information and Systems, 2020, 140 (08) : 949 - 955
  • [34] Concept drift detection on stream data for revising DBSCAN
    Miyata, Yasushi
    Ishikawa, Hiroshi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2021, 104 (01) : 87 - 94
  • [35] Efficient Handling of Concept Drift and Concept Evolution over Stream Data
    Haque, Ahsanul
    Khan, Latifur
    Baron, Michael
    Thuraisingham, Bhavani
    Aggarwal, Charu
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 481 - 492
  • [36] Online Classification Algorithm for Uncertain Data Stream in Big Data
    Lyu Y.X.
    Wang C.R.
    Wang C.
    Yu C.Y.
    Lyu, Yan Xia (shaoqilyx@163.com), 1600, Northeast University (37): : 1245 - 1249
  • [37] RGNBC: Rough Gaussian Na⟨ve Bayes Classifier for Data Stream Classification with Recurring Concept Drift
    Babu, D. Kishore
    Ramadevi, Y.
    Ramana, K. V.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2017, 42 (02) : 705 - 714
  • [38] Concept Drift–Based Intrusion Detection For Evolving Data Stream Classification In IDS: Approaches And Comparative Study
    Seth, Sugandh
    Chahal, Kuljit Kaur
    Singh, Gurvinder
    Computer Journal, 1600, 67 (07): : 2529 - 2547
  • [39] RGNBC: Rough Gaussian Naïve Bayes Classifier for Data Stream Classification with Recurring Concept Drift
    D. Kishore Babu
    Y. Ramadevi
    K. V. Ramana
    Arabian Journal for Science and Engineering, 2017, 42 : 705 - 714
  • [40] Data stream mining: methods and challenges for handling concept drift
    Scott Wares
    John Isaacs
    Eyad Elyan
    SN Applied Sciences, 2019, 1