Mining frequent closed trees in evolving data streams

被引:9
|
作者
Bifet, Albert [1 ,2 ]
Gavalda, Ricard [2 ]
机构
[1] Univ Waikato, Dept Comp Sci, Barcelona, Spain
[2] Univ Politecn Cataluna, Dept LSI, LARCA Res Grp, Barcelona, Spain
关键词
ITEMSETS;
D O I
10.3233/IDA-2010-0454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose new algorithms for adaptively mining closed rooted trees, both labeled and unlabeled, from data streams that change over time. Closed patterns are powerful representatives of frequent patterns, since they eliminate redundant information. Our approach is based on an advantageous representation of trees and a low-complexity notion of relaxed closed trees, as well as ideas from Galois Lattice Theory. More precisely, we present three closed tree mining algorithms in sequence: an incremental one, INCTREEMINER, a sliding-window based one, WINTREEMINER, and finally one that mines closed trees adaptively from data streams, ADATREEMINER. By adaptive we mean here that it presents at all times the closed trees that are frequent in the current state of the data stream. To the best of our knowledge this is the first work on mining closed frequent trees in streaming data varying with time. We give a first experimental evaluation of the proposed algorithms.
引用
收藏
页码:29 / 48
页数:20
相关论文
共 50 条
  • [1] Mining evolving data streams for frequent patterns
    Laur, Pierre-Alain
    Nock, Richard
    Symphor, Jean-Emile
    Poncelet, Pascal
    PATTERN RECOGNITION, 2007, 40 (02) : 492 - 503
  • [2] A Survey on Closed Frequent Itemset Mining on Data Streams
    Bai, Pavitra . S.
    Kumar, Ravi . G. . K.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 542 - 547
  • [3] Fast Mining of Closed Frequent Itemsets in Data Streams
    Mao Yimin
    Chen Zhigang
    Liu Lixin
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 231 - +
  • [4] A Novel Strategy for Mining Frequent Closed Itemsets in Data Streams
    Tang, Keming
    Dai, Caiyan
    Chen, Ling
    JOURNAL OF COMPUTERS, 2012, 7 (07) : 1564 - 1573
  • [5] An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams
    Ao, Fujiang
    Du, Jing
    Yan, Yuejin
    Liu, Baohong
    Huang, Kedi
    8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 37 - +
  • [6] Mining frequent closed patterns with item constraints in data streams
    Hu, Wei-Cheng
    Wang, Ben-Nian
    Cheng, Zhuan-Liu
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 274 - 280
  • [7] A Change Detector for Mining Frequent Patterns over Evolving Data Streams
    Ng, Willie
    Dash, Manoranjan
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2406 - +
  • [8] Mining frequent closed rooted trees
    Balcazar, Jose L.
    Bifet, Albert
    Lozano, Antoni
    MACHINE LEARNING, 2010, 78 (1-2) : 1 - 33
  • [9] Mining frequent closed rooted trees
    José L. Balcázar
    Albert Bifet
    Antoni Lozano
    Machine Learning, 2010, 78 : 1 - 33
  • [10] An Efficient Frequent Closed Itemsets Mining Algorithm Over Data Streams
    Tan, Jun
    Yu, Shao-jun
    2011 SECOND INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND EDUCATION APPLICATION (ICEA 2011), 2011, : 197 - 201