An incremental fuzzy decision tree classification method for mining data streams

被引:0
|
作者
Wang, Tao [1 ]
Li, Zhoujun [2 ]
Yan, Yuejin [1 ]
Chen, Huowang [1 ]
机构
[1] Natl Univ Def Technol, Comp Sch, Changsha 410073, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100083, Peoples R China
基金
美国国家科学基金会;
关键词
data streams; incremental; fuzzy; continuous attribute; threaded binary search tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of most important algorithms for mining data streams is VFDT. It uses Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. Gama et al. have extended VFDT in two directions. Their system VFDTc can deal with continuous data and use more powerful classification techniques at tree leaves. In this paper, we revisit this problem and implemented a system fVFDT on top of VFDT and VFDTc. We make the following four contributions: 1) we present a threaded binary search trees (TBST) approach for efficiently handling continuous attributes. It builds a threaded binary search tree, and its processing time for values inserting is O(nlogn), while VFDT's processing time is O(n(2)). When a new example arrives, VFDTc need update O(logn) attribute tree nodes, but fVFDT just need update one necessary node.2) we improve the method of getting the best split-test point of a given continuous attribute. Comparing to the method used in VFDTc, it improves from O(nlogn) to O (n) in processing time. 3) Comparing to VFDTc, fVFDT's candidate split-test number decrease from O(n) to O(logn).4)lmprove the soft discretization method to be used in data streams mining, it overcomes the problem of noise data and improve the classification accuracy.
引用
收藏
页码:91 / +
页数:4
相关论文
共 50 条
  • [21] Regularized and incremental decision trees for data streams
    Barddal, Jean Paul
    Enembreck, Fabricio
    ANNALS OF TELECOMMUNICATIONS, 2020, 75 (9-10) : 493 - 503
  • [22] Applying Fuzzy Decision Tree Method for Hypertension Classification in Adolescent
    Sofyan, Hizir
    Elfayani, Elfayani
    Rahmatika, Azalya
    Marzuki, Marzuki
    Irvanizam, Irvanizam
    INTELLIGENT AND FUZZY SYSTEMS: DIGITAL ACCELERATION AND THE NEW NORMAL, INFUS 2022, VOL 1, 2022, 504 : 360 - 368
  • [23] Class Specific Fuzzy Decision Trees for Mining High Speed Data Streams
    Hashemi, Sattar
    Kangavari, Mohammadreza
    Yang, Ying
    FUNDAMENTA INFORMATICAE, 2008, 88 (1-2) : 135 - 160
  • [24] Regularized and incremental decision trees for data streams
    Jean Paul Barddal
    Fabrício Enembreck
    Annals of Telecommunications, 2020, 75 : 493 - 503
  • [25] Decision trees for mining data streams
    Gama, Joao
    Fernandes, Ricardo
    Rocha, Ricardo
    INTELLIGENT DATA ANALYSIS, 2006, 10 (01) : 23 - 45
  • [26] Vlsi Implementation Of Flexible Architecture For Decision Tree Classification In Data Mining
    Sharma, K. Venkatesh
    Shewandagn, Behailu
    Bhukya, Shankar Nayak
    INTERNATIONAL CONFERENCE ON FUNCTIONAL MATERIALS, CHARACTERIZATION, SOLID STATE PHYSICS, POWER, THERMAL AND COMBUSTION ENERGY (FCSPTC-2017), 2017, 1859
  • [27] The research of decision tree learning algorithm in technology of data mining classification
    Department of Mechanical and Electrical Information, Lishui Vocational and Technical College, ZheJiang, China
    J. Convergence Inf. Technol., 2012, 10 (216-223):
  • [28] Data Clustering and Evolving Fuzzy Decision Tree for Data Base Classification Problems
    Chang, Pei-Chann
    Fan, Chin-Yuan
    Wang, Yen-Wen
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2008, 15 : 463 - +
  • [29] An efficient and sensitive decision tree approach to mining concept-drifting data streams
    Tsai, Cheng-Jurig
    Lee, Chien-I
    Yang, Wei-Pang
    INFORMATICA, 2008, 19 (01) : 135 - 156
  • [30] A hybrid decision tree/genetic algorithm method for data mining
    Carvalho, DR
    Freitas, AA
    INFORMATION SCIENCES, 2004, 163 (1-3) : 13 - 35