An Improved Algorithm of Decision Trees for Streaming Data Based on VFDT

被引:3
|
作者
Li, Feixiong [1 ]
Liu, Quan [1 ]
机构
[1] Soochow Univ, Prov Key Lab Comp Informat Proc Technol, Suzhou 215006, Peoples R China
关键词
Streaming Data Mining; Decision Trees; Unequal Interval Numerical Pruning(UINP); Naive Bayes Classifiers;
D O I
10.1109/ISISE.2008.256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Decision tree is a good model of Classification. Recently, there has been much interest in mining streaming data. Because streaming data is large and no limited, it is unpractical that passing the entire data over more than one time. A one pass online algorithm is necessary. One of the most successful algorithms for mining data streams is VFDT(Very Fast Decision Tree). we extend the VFDT system to EVFDT(Efficient-VFDT) in two directions: (1)We present Uneven Interval Numerical Pruning (shortly UINP) approach for efficiently processing numerical attributes. (2)We use naive Bayes classifiers associated with the node to process the samples to detect the outlying samples and reduce the scale of the trees. From the experimental comparison, the two techniques significantly improve the efficiency and the accuracy of decision tree construction on streaming data.
引用
收藏
页码:597 / 600
页数:4
相关论文
共 50 条
  • [31] A Storm-Based Parallel Clustering Algorithm of Streaming Data
    Xu, Fang-Zhu
    Jiang, Zhi-Ying
    He, Yan-Lin
    Wang, Ya-Jie
    Zhu, Qun-Xiong
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 134 - 144
  • [32] A streaming data Delaunay triangulation algorithm based on parallel computing
    Li, Jian
    Li, Deren
    Shao, Zhenfeng
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2013, 38 (07): : 794 - 798
  • [33] Multidimensional data management decision of the ground LiDAR resources based on the improved differential evolution algorithm
    Li, Jing
    Chung, Soo-Jin
    Yang, MaoBao
    Xu, Jin
    Guo, HangYuan
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 40 - 40
  • [34] An Improved Decision Tree Algorithm Based on Mutual Information
    Fang, Lietao
    Jiang, Hong
    Cui, Shuqi
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [35] Improved edge detection algorithm based on decision tree
    Cai Aiping
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 1080 - 1084
  • [36] Decision Boundary Learning Based on an Improved PSO Algorithm
    Watarai, Kyohei
    Zhao, Qiangfu
    Kaneda, Yuya
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2958 - 2962
  • [37] A packet classification algorithm based on improved decision tree
    Anyang Institute of Technology, Anyang, Henan, 455000, China
    1600, Academy Publisher (08):
  • [38] Decision Trees for Uncertain Data
    Tsang, Smith
    Kao, Ben
    Yip, Kevin Y.
    Ho, Wai-Shing
    Lee, Sau Dan
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 441 - +
  • [39] An improved algorithm for decision-tree-based SVM
    Wang, Xiaodan
    Shi, Zhaohui
    Wu, Chongming
    Wang, Wei
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4234 - +
  • [40] Decision Trees for Uncertain Data
    Tsang, Smith
    Kao, Ben
    Yip, Kevin Y.
    Ho, Wai-Shing
    Lee, Sau Dan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (01) : 64 - 78