Approximate mining of maximal frequent itemsets in data streams with different window models

被引:10
|
作者
Li, Hua-Fu [1 ]
Lee, Suh-Yin [2 ]
机构
[1] Kainan Univ, Dept Comp Sci, Tao Yuan 338, Taiwan
[2] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 300, Taiwan
关键词
data mining; data streams; maximal frequent itemsets; one-pass mining; approximate mining;
D O I
10.1016/j.eswa.2007.07.046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A data stream is a massive, open-ended sequence of data elements continuously generated at a rapid rate. Mining data streams is more difficult than mining static databases because the huge, high-speed and continuous characteristics of streaming data. In this paper, we propose a new one-pass algorithm called DSM-MFI (stands for Data Stream Mining for Maximal Frequent Itemsets), which mines the set of all maximal frequent itemsets in landmark windows over data streams. A new summary data structure called summary frequent itemset forest (abbreviated as SFI-forest) is developed for incremental maintaining the essential information about maximal frequent itemsets embedded in the stream so far. Theoretical analysis and experimental studies show that the proposed algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of the data streams. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:781 / 789
页数:9
相关论文
共 50 条
  • [21] Approximate Frequent Itemsets Mining on Data Streams Using Hashing and Lexicographic Order in Hardware
    Bustio-Martinez, Lazaro
    Cumplido, Rene
    Letras-Luna, Martin
    Feregrino Uribe, Claudia
    Hernandez-Leon, Raudel
    Bande-Serrano, Jose M.
    2017 IEEE 8TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS & SYSTEMS (LASCAS), 2017,
  • [22] Mining frequent closed itemsets from a landmark window over online data streams
    Liu, Xuejun
    Guan, Jihong
    Hu, Ping
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 57 (06) : 927 - 936
  • [23] Variable slide window based frequent itemsets mining algorithm on large data streams
    Zhu, Xiao-Dong
    Huang, Zhi-Qiu
    Shen, Guo-Hua
    Yuan, Min
    Kongzhi yu Juece/Control and Decision, 2009, 24 (06): : 832 - 836
  • [24] A frequent itemsets mining algorithm based on matrix in sliding window over data streams
    Fan Guidan
    Yin Shaohong
    2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 66 - 69
  • [25] Mining frequent itemsets over data streams using efficient window sliding techniques
    Li, Hua-Fu
    Lee, Suh-Yin
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1466 - 1477
  • [26] Efficient maintenance and mining of frequent itemsets over Online data streams with a sliding window
    Hua-Fu Li
    Chin-Chuan Ho
    Man-Kwan Shan
    Suh-Yin Lee
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 2672 - +
  • [27] An efficient approach to mining frequent itemsets on data streams
    Ansari, Sara
    Sadreddini, Mohammad Hadi
    World Academy of Science, Engineering and Technology, 2009, 37 : 489 - 495
  • [28] Fast Mining of Closed Frequent Itemsets in Data Streams
    Mao Yimin
    Chen Zhigang
    Liu Lixin
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 231 - +
  • [29] Mining of Frequent Itemsets from Streams of Uncertain Data
    Leung, Carson Kai-Sang
    Hao, Boyu
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1663 - 1670
  • [30] Efficient mining of frequent itemsets from data streams
    Leung, Carson Kai-Sang
    Brajczuk, Dale A.
    SHARING DATA, INFORMATION AND KNOWLEDGE, PROCEEDINGS, 2008, 5071 : 2 - 14