Novel Class Detection in Concept-Drifting Data Stream Mining Employing Decision Tree

被引:0
|
作者
Farid, Dewan Md [1 ]
Rahman, Chowdhury Mofizur [1 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka 1209, Bangladesh
关键词
Conpect drift; data stream mining; decision tree; incremental learning; novel class; EVOLVING DATA STREAMS; CLASSIFICATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a new approach for detecting novel class in data stream mining using decision tree classifier that can determine whether an unseen or new instance belongs to a novel class. Most existing data mining classifiers can not detect and classify the novel class instances in real-time data stream mining problems like weather conditions, economical changes, astronomical, and intrusion detection etc, untill the classification models are trained with the labeled instances of the novel class. Arrival of a novel class in concept-drift occurs in data stream mining when new data introduce the new concept classes or remove the old ones. The proposed approach for incremental learning of concept drift considers mining, where the streaming data distributions change over time. It build a decision tree model from training dataset, which continuously updates so that the tree represents the most recent concept in data stream. The experiments on real benchmark data evaluate the efficiency of the proposed approach in both detecting the novel class and classification accuracy with comparisons of traditional data mining classifiers.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] An adaptive distributed ensemble approach to mine concept-drifting data streams
    Folino, Gianluigi
    Pizzuti, Clara
    Spezzano, Giandomenico
    [J]. 19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, : 183 - 187
  • [32] Selective prototype-based learning on concept-drifting data streams
    Chen, Dongzi
    Yang, Qinli
    Liu, Jiaming
    Zeng, Zhu
    [J]. INFORMATION SCIENCES, 2020, 516 : 20 - 32
  • [33] Granularity adaptive density estimation and on demand clustering of concept-drifting data streams
    Zhu, Weiheng
    Pei, Jian
    Yin, Jian
    Xie, Yihuang
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 322 - 331
  • [34] A Statistical Decision Tree Algorithm for Medical Data Stream Mining
    Cazzolato, Mirela Teixeira
    Ribeiro, Marcela Xavier
    [J]. 2013 IEEE 26TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2013, : 389 - 392
  • [35] Enhancement of Very Fast Decision Tree for Data Stream Mining
    Lefa, Mai
    Abd-Elkader, Hatem
    Salem, Rashed
    [J]. STUDIES IN INFORMATICS AND CONTROL, 2022, 31 (02): : 49 - 60
  • [36] Ubiquitous Self-Organizing Map: Learning Concept-Drifting Data Streams
    Silva, Bruno
    Marques, Nuno
    [J]. NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, PT 1, 2015, 353 : 713 - 722
  • [37] Decision tree-based Feature Ranking in Concept Drifting Data Streams
    Pereira Karax, Jean Antonio
    Malucelli, Andreia
    Barddal, Jean Paul
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 590 - 592
  • [38] Online multi-dimensional regression analysis on concept-drifting data streams
    Nadungodage, Chandima Hewa
    Xia, Yuni
    Vaidya, Pranav S.
    Chen, Yu
    Lee, Jaehwan John
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2014, 6 (03) : 217 - 238
  • [39] Method of Concept-Drifting Feature Extracting in Data Streams based on Granular Computing
    Ju, Chunhua
    Shuai, Zhaoqian
    [J]. INTELLIGENT STRUCTURE AND VIBRATION CONTROL, PTS 1 AND 2, 2011, 50-51 : 934 - +
  • [40] An Ensemble of Classifiers Algorithm Based on GA for Handling Concept-Drifting Data Streams
    Guan, Jinghua
    Guo, Wu
    Chen, Heng
    Lou, Oujun
    [J]. 2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 282 - 284