Mining frequent closed trees in evolving data streams

被引:9
|
作者
Bifet, Albert [1 ,2 ]
Gavalda, Ricard [2 ]
机构
[1] Univ Waikato, Dept Comp Sci, Barcelona, Spain
[2] Univ Politecn Cataluna, Dept LSI, LARCA Res Grp, Barcelona, Spain
关键词
ITEMSETS;
D O I
10.3233/IDA-2010-0454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose new algorithms for adaptively mining closed rooted trees, both labeled and unlabeled, from data streams that change over time. Closed patterns are powerful representatives of frequent patterns, since they eliminate redundant information. Our approach is based on an advantageous representation of trees and a low-complexity notion of relaxed closed trees, as well as ideas from Galois Lattice Theory. More precisely, we present three closed tree mining algorithms in sequence: an incremental one, INCTREEMINER, a sliding-window based one, WINTREEMINER, and finally one that mines closed trees adaptively from data streams, ADATREEMINER. By adaptive we mean here that it presents at all times the closed trees that are frequent in the current state of the data stream. To the best of our knowledge this is the first work on mining closed frequent trees in streaming data varying with time. We give a first experimental evaluation of the proposed algorithms.
引用
收藏
页码:29 / 48
页数:20
相关论文
共 50 条
  • [41] Mining frequent closed itemsets for large data
    Fu, HG
    Nguifo, EM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 328 - 335
  • [42] Learning model trees from evolving data streams
    Ikonomovska, Elena
    Gama, Joao
    Dzeroski, Saso
    DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 23 (01) : 128 - 168
  • [43] Learning model trees from evolving data streams
    Elena Ikonomovska
    João Gama
    Sašo Džeroski
    Data Mining and Knowledge Discovery, 2011, 23 : 128 - 168
  • [44] Efficient mining of frequent itemsets from data streams
    Leung, Carson Kai-Sang
    Brajczuk, Dale A.
    SHARING DATA, INFORMATION AND KNOWLEDGE, PROCEEDINGS, 2008, 5071 : 2 - 14
  • [45] An efficient algorithm for frequent itemset mining on data streams
    Xie Zhi-Jun
    Chen Hong
    Li, Cuiping
    ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 474 - 491
  • [46] Anytime Frequent Itemset Mining of Transactional Data Streams
    Goyal, Poonam
    Challa, Jagat Sesh
    Shrivastava, Shivin
    Goyal, Navneet
    BIG DATA RESEARCH, 2020, 21
  • [47] Methods for mining frequent items in data streams: an overview
    Hongyan Liu
    Yuan Lin
    Jiawei Han
    Knowledge and Information Systems, 2011, 26 : 1 - 30
  • [48] Mining maximal frequent itemsets from data streams
    Mao, Guojun
    Wu, Xindong
    Zhu, Xingquan
    Chen, Gong
    Liu, Chunnian
    JOURNAL OF INFORMATION SCIENCE, 2007, 33 (03) : 251 - 262
  • [49] An efficient approach to mining frequent itemsets on data streams
    Ansari, Sara
    Sadreddini, Mohammad Hadi
    World Academy of Science, Engineering and Technology, 2009, 37 : 489 - 495
  • [50] Mining of Frequent Itemsets from Streams of Uncertain Data
    Leung, Carson Kai-Sang
    Hao, Boyu
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1663 - 1670