Novel Class Detection in Concept-Drifting Data Stream Mining Employing Decision Tree

被引:0
|
作者
Farid, Dewan Md [1 ]
Rahman, Chowdhury Mofizur [1 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka 1209, Bangladesh
关键词
Conpect drift; data stream mining; decision tree; incremental learning; novel class; EVOLVING DATA STREAMS; CLASSIFICATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a new approach for detecting novel class in data stream mining using decision tree classifier that can determine whether an unseen or new instance belongs to a novel class. Most existing data mining classifiers can not detect and classify the novel class instances in real-time data stream mining problems like weather conditions, economical changes, astronomical, and intrusion detection etc, untill the classification models are trained with the labeled instances of the novel class. Arrival of a novel class in concept-drift occurs in data stream mining when new data introduce the new concept classes or remove the old ones. The proposed approach for incremental learning of concept drift considers mining, where the streaming data distributions change over time. It build a decision tree model from training dataset, which continuously updates so that the tree represents the most recent concept in data stream. The experiments on real benchmark data evaluate the efficiency of the proposed approach in both detecting the novel class and classification accuracy with comparisons of traditional data mining classifiers.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] An efficient and sensitive decision tree approach to mining concept-drifting data streams
    Tsai, Cheng-Jurig
    Lee, Chien-I
    Yang, Wei-Pang
    [J]. INFORMATICA, 2008, 19 (01) : 135 - 156
  • [2] Pyramid Stack Data Stream Mining for Handling Concept-drifting
    Xu, Zhuoran
    Hou, Cuiqin
    Xia, Yingju
    Sun, Jun
    Inakoshi, Hiroya
    Yugami, Nobuhiro
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 33 - 37
  • [3] Integrating Novel Class Detection with Classification for Concept-Drifting Data Streams
    Masud, Mohammad M.
    Gao, Jing
    Khan, Latifur
    Han, Jiawei
    Thuraisingham, Bhavani
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 79 - +
  • [4] Ambiguous decision trees for mining concept-drifting data streams
    Liu, Jing
    Li, Xue
    Zhong, Weicai
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (15) : 1347 - 1355
  • [5] Novel Class Detection in Concept Drifting Data Streams Using Decision Tree Leaves
    Saha, Deepita
    Haque, Md Mozzammel
    Sarkar, Akash
    Alam, Famina
    Farid, Dewan Md
    Rahman, Chowdhury Mofizur
    Shatabda, Swakkhar
    [J]. 2018 4TH IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (IEEE WIECON-ECE 2018), 2018, : 87 - 90
  • [6] Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints
    Masud, Mohammad M.
    Gao, Jing
    Khan, Latifur
    Han, Jiawei
    Thuraisingham, Bhavani
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (06) : 859 - 874
  • [7] Mining Concept-Drifting Data Streams with Multiple Semi-Random Decision Trees
    Li, Peipei
    Hu, Xuegang
    Wu, Xindong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 733 - 740
  • [8] On reducing classifier granularity in mining concept-drifting data streams
    Wang, P
    Wang, HX
    Wu, XC
    Wang, W
    Shi, BL
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 474 - 481
  • [9] Mining Concept-Drifting Data Streams Containing Labeled and Unlabeled Instances
    Borchani, Hanen
    Larranaga, Pedro
    Bielza, Concha
    [J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT I, PROCEEDINGS, 2010, 6096 : 531 - 540
  • [10] Mining Concept-Drifting and Noisy Data Streams using Ensemble Classifiers
    Ouyang, Zhenzheng
    Zhou, Min
    Wang, Tao
    Wu, Quanyuan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 360 - +