Mining frequent closed trees in evolving data streams

被引:9
|
作者
Bifet, Albert [1 ,2 ]
Gavalda, Ricard [2 ]
机构
[1] Univ Waikato, Dept Comp Sci, Barcelona, Spain
[2] Univ Politecn Cataluna, Dept LSI, LARCA Res Grp, Barcelona, Spain
关键词
ITEMSETS;
D O I
10.3233/IDA-2010-0454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose new algorithms for adaptively mining closed rooted trees, both labeled and unlabeled, from data streams that change over time. Closed patterns are powerful representatives of frequent patterns, since they eliminate redundant information. Our approach is based on an advantageous representation of trees and a low-complexity notion of relaxed closed trees, as well as ideas from Galois Lattice Theory. More precisely, we present three closed tree mining algorithms in sequence: an incremental one, INCTREEMINER, a sliding-window based one, WINTREEMINER, and finally one that mines closed trees adaptively from data streams, ADATREEMINER. By adaptive we mean here that it presents at all times the closed trees that are frequent in the current state of the data stream. To the best of our knowledge this is the first work on mining closed frequent trees in streaming data varying with time. We give a first experimental evaluation of the proposed algorithms.
引用
收藏
页码:29 / 48
页数:20
相关论文
共 50 条
  • [31] Mining Recent Frequent Itemsets in Data Streams
    Li, Kun
    Wang, Yong-yan
    Ellahi, Manzoor
    Wang, Hong-an
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 4, PROCEEDINGS, 2008, : 353 - 358
  • [32] Mining Robust Frequent Items in Data Streams
    Xia, Rui
    Dai, Haipeng
    Du, Zhanchao
    Li, Meng
    Liu, Alex X.
    Chen, Guihai
    2020 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2020), 2020, : 110 - 117
  • [33] MFIS - Mining frequent itemsets on data streams
    Xie, Zhi-jun
    Chen, Hong
    Li, Cuiping
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 1085 - 1093
  • [34] Data Streams Fusion by Frequent Correlations Mining
    Ziembinski, Radoslaw Z.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 1 - 8
  • [35] RETRACTED: Mining Top-K frequent closed itemset in data streams (Retracted Article)
    Li, Jun
    Hou, Xiuhong
    Gong, Sen
    2011 INTERNATIONAL CONFERENCE ON ENERGY AND ENVIRONMENTAL SCIENCE-ICEES 2011, 2011, 11
  • [36] A New Sliding Window Based Algorithm for Frequent Closed Itemset Mining Over Data Streams
    Nori, Fatemeh
    Deypir, Mahmood
    Sadreddini, Mohamad Hadi
    Ziarati, Korosh
    2011 1ST INTERNATIONAL ECONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2011, : 249 - 253
  • [37] Mining Evolving Data Streams with Particle Filters
    Fok, Ricky
    An, Aijun
    Wang, Xiaogang
    COMPUTATIONAL INTELLIGENCE, 2017, 33 (02) : 147 - 180
  • [38] Random Forests of Very Fast Decision Trees on GPU for Mining Evolving Big Data Streams
    Marron, Diego
    Bifet, Albert
    Morales, Gianmarco De Francisci
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 615 - +
  • [39] Mining frequent closed unordered trees through natural representations
    Balcazar, Jose L.
    Bifet, Albert
    Lozano, Antoni
    CONCEPTUAL STRUCTURES: KNOWLEDGE ARCHITECTURES FOR SMART APPLICATIONS, PROCEEDINGS, 2007, 4604 : 347 - +
  • [40] Mining frequent closed patterns in microarray data
    Cong, G
    Tan, KL
    Tung, AKH
    Pan, F
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 363 - 366