An incremental fuzzy decision tree classification method for mining data streams

被引:0
|
作者
Wang, Tao [1 ]
Li, Zhoujun [2 ]
Yan, Yuejin [1 ]
Chen, Huowang [1 ]
机构
[1] Natl Univ Def Technol, Comp Sch, Changsha 410073, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100083, Peoples R China
基金
美国国家科学基金会;
关键词
data streams; incremental; fuzzy; continuous attribute; threaded binary search tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of most important algorithms for mining data streams is VFDT. It uses Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. Gama et al. have extended VFDT in two directions. Their system VFDTc can deal with continuous data and use more powerful classification techniques at tree leaves. In this paper, we revisit this problem and implemented a system fVFDT on top of VFDT and VFDTc. We make the following four contributions: 1) we present a threaded binary search trees (TBST) approach for efficiently handling continuous attributes. It builds a threaded binary search tree, and its processing time for values inserting is O(nlogn), while VFDT's processing time is O(n(2)). When a new example arrives, VFDTc need update O(logn) attribute tree nodes, but fVFDT just need update one necessary node.2) we improve the method of getting the best split-test point of a given continuous attribute. Comparing to the method used in VFDTc, it improves from O(nlogn) to O (n) in processing time. 3) Comparing to VFDTc, fVFDT's candidate split-test number decrease from O(n) to O(logn).4)lmprove the soft discretization method to be used in data streams mining, it overcomes the problem of noise data and improve the classification accuracy.
引用
收藏
页码:91 / +
页数:4
相关论文
共 50 条
  • [1] A new fuzzy decision tree classification method for mining high-speed data streams based on binary search trees
    Li, Zhoujun
    Wang, Tao
    Wang, Ruoxue
    Yan, Yuejin
    Chen, Huowang
    [J]. FRONTIERS IN ALGORITHMICS, PROCEEDINGS, 2007, 4613 : 216 - +
  • [2] An Efficient Decision Tree Classification Method Based on Extended Hash Table for Data Streams Mining
    Ouyang, Zhenzheng
    Wu, Quanyuan
    Wang, Tao
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 313 - +
  • [3] AN INCREMENTAL DECISION TREE FOR MINING MULTILABEL DATA
    Li, Peipei
    Wu, Xindong
    Hu, Xuegang
    Wang, Hao
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (10) : 992 - 1014
  • [4] The CART decision tree for mining data streams
    Rutkowski, Leszek
    Jaworski, Maciej
    Pietruczuk, Lena
    Duda, Piotr
    [J]. INFORMATION SCIENCES, 2014, 266 : 1 - 15
  • [5] Extremely Fast Decision Tree Mining for Evolving Data Streams
    Bifet, Albert
    Zhang, Jiajin
    Fan, Wei
    He, Cheng
    Zhang, Jianfeng
    Qian, Jianfeng
    Holmes, Geoff
    Pfahringer, Bernhard
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1733 - 1742
  • [6] A new decision tree classification method for mining high-speed data streams based on threaded binary search trees
    Wang, Tao
    Li, Zhoujun
    Hu, Xiaohua
    Yan, Yuejin
    Chen, Huowang
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 256 - +
  • [7] Classification of Data Streams by Incremental Semi-supervised Fuzzy Clustering
    Castellano, G.
    Fanelli, A. M.
    [J]. FUZZY LOGIC AND SOFT COMPUTING APPLICATIONS, WILF 2016, 2017, 10147 : 185 - 194
  • [8] Elegant decision tree algorithm for classification in data mining
    Chandra, B
    Mazumdar, S
    Arena, V
    Parimi, N
    [J]. WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 160 - 169
  • [9] Incremental Learning of Fuzzy Decision Trees for Streaming Data Classification
    Pecori, Riccardo
    Ducange, Pietro
    Marcelloni, Francesco
    [J]. PROCEEDINGS OF THE 11TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT 2019), 2019, 1 : 748 - 755
  • [10] Incremental Optimization Mechanism for Constructing a Decision Tree in Data Stream Mining
    Yang, Hang
    Fong, Simon
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013