An incremental fuzzy decision tree classification method for mining data streams

被引:0
|
作者
Wang, Tao [1 ]
Li, Zhoujun [2 ]
Yan, Yuejin [1 ]
Chen, Huowang [1 ]
机构
[1] Natl Univ Def Technol, Comp Sch, Changsha 410073, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100083, Peoples R China
基金
美国国家科学基金会;
关键词
data streams; incremental; fuzzy; continuous attribute; threaded binary search tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of most important algorithms for mining data streams is VFDT. It uses Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. Gama et al. have extended VFDT in two directions. Their system VFDTc can deal with continuous data and use more powerful classification techniques at tree leaves. In this paper, we revisit this problem and implemented a system fVFDT on top of VFDT and VFDTc. We make the following four contributions: 1) we present a threaded binary search trees (TBST) approach for efficiently handling continuous attributes. It builds a threaded binary search tree, and its processing time for values inserting is O(nlogn), while VFDT's processing time is O(n(2)). When a new example arrives, VFDTc need update O(logn) attribute tree nodes, but fVFDT just need update one necessary node.2) we improve the method of getting the best split-test point of a given continuous attribute. Comparing to the method used in VFDTc, it improves from O(nlogn) to O (n) in processing time. 3) Comparing to VFDTc, fVFDT's candidate split-test number decrease from O(n) to O(logn).4)lmprove the soft discretization method to be used in data streams mining, it overcomes the problem of noise data and improve the classification accuracy.
引用
收藏
页码:91 / +
页数:4
相关论文
共 50 条
  • [41] FUDT: A Fuzzy Uncertain Decision Tree Algorithm for Classification of Uncertain Data
    Meenakshi, S.
    Venkatachalam, V.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2015, 40 (11) : 3187 - 3196
  • [42] Incremental Fuzzy Association Rule Mining for Classification and Regression
    Wang, Ling
    Ma, Qian
    Meng, Jianyao
    IEEE ACCESS, 2019, 7 : 121095 - 121110
  • [43] Fuzzy Random Decision Tree (FRDT) Framework for Privacy Preserving Data Mining
    Sumalatha, L.
    Sankar, P. Uma
    PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 195 - 202
  • [44] Data mining for multi-agent fuzzy decision tree structure and rules
    Smith, JF
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL I, 2002, : 72 - 79
  • [45] Incremental Mining of Across-streams Sequential Patterns in Multiple Data Streams
    Yang, Shih-Yang
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    JOURNAL OF COMPUTERS, 2011, 6 (03) : 449 - 457
  • [46] Decision Tree for Dynamic and Unceratin Data streams
    Liang, Chunquan
    Zhang, Yang
    Song, Qun
    PROCEEDINGS OF 2ND ASIAN CONFERENCE ON MACHINE LEARNING (ACML2010), 2010, 13 : 209 - 224
  • [47] Incremental linear discriminant analysis for classification of data streams
    Pang, S
    Ozawa, S
    Kasabov, N
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (05): : 905 - 914
  • [48] Privacy protection data mining algorithm in blockchain based on decision tree classification
    Cao, Yu
    Wei, Wei
    Zhou, Jin
    WEB INTELLIGENCE, 2022, 20 (02) : 103 - 112
  • [49] Online and offline hybrid teaching data mining based on decision tree classification
    Cao, Yu
    Chen, Shu-Wen
    Zhu, Hui-Sheng
    INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2024, 34 (05) : 489 - 500
  • [50] Comparative study between decision tree and knn of data mining classification technique
    Mohanapriya, M.
    Lekha, J.
    SECOND NATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE (NCCI 2018), 2018, 1142