Mining Approximate Frequent Itemsets over Data Streams Using Window Sliding Techniques

被引:0
|
作者
Kim, Younghee [1 ]
Park, Eunkyoung [1 ]
Kim, Ungmo [1 ]
机构
[1] Sungkyunkwan Univ, Sch Informat & Commun Engn, Suwon 440746, Gyeonggi Do, South Korea
来源
关键词
Data Stream; Maximal approximate frequent itemsets; Potential frequent itemsets; Chernoff bound;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent itemset mining is a core data mining operation and has been extensively studied in a broad range of application. The frequent data stream itemset mining is to find an approximate set of frequent itemsets in transaction with respect to a given support threshold. In this paper, we consider the problem of approximate that frequency counts for space efficient computation over data stream sliding windows. Approximate frequent itemsets mining algorithms use a user-specified error parameter, E, to obtain an extra set of itemsets that are potential to become frequent later. Hence, we developed an algorithm based on the Chernoff bound for finding frequent itemsets over data stream sliding window. We present an improved algorithm MAFIM (a maximal approximate frequent itemsets mining) for frequent itemsets mining based on approximate counting using previous saved maximal frequent itemsets. The proposed algorithm gave a guarantee of the output quality and also a bound on the memory usage.
引用
收藏
页码:49 / 56
页数:8
相关论文
共 50 条
  • [1] Mining frequent itemsets over data streams using efficient window sliding techniques
    Li, Hua-Fu
    Lee, Suh-Yin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1466 - 1477
  • [2] Mining weighted frequent itemsets using window sliding over data streams
    Kim, Younghee
    Kim, Wonyoung
    Ryu, Joonsuk
    Kim, Ungmo
    [J]. ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 708 - 713
  • [3] Mining maximal frequent itemsets in a sliding window over data streams
    Mao, Yimin
    Li, Hong
    Yang, Luming
    Liu, Lixin
    [J]. Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (11): : 1142 - 1148
  • [4] Mining Recent Maximal Frequent Itemsets Over Data Streams with Sliding Window
    Cai, Saihua
    Hao, Shangbo
    Sun, Ruizhi
    Wu, Gang
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (06) : 961 - 969
  • [5] Mining frequent itemsets in data streams using the weighted sliding window model
    Tsai, Pauray S. M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (09) : 11617 - 11625
  • [6] A frequent itemsets mining algorithm based on matrix in sliding window over data streams
    Fan Guidan
    Yin Shaohong
    [J]. 2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 66 - 69
  • [7] Efficient maintenance and mining of frequent itemsets over Online data streams with a sliding window
    Hua-Fu Li
    Chin-Chuan Ho
    Man-Kwan Shan
    Suh-Yin Lee
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 2672 - +
  • [8] Mining Recent Frequent Itemsets over Data Streams with a Time-Sensitive Sliding Window
    Jin, Long
    Chai, Duck Jin
    Lee, Jun Wook
    Ryu, Keun Ho
    [J]. ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, PROCEEDINGS, 2007, 4537 : 62 - +
  • [9] Mining Discriminative Itemsets Over Data Streams Using Efficient Sliding Window
    Seyfi M.
    Nayak R.
    Xu Y.
    [J]. SN Computer Science, 4 (5)
  • [10] Sliding Window-based Frequent Itemsets Mining over Data Streams using Tail Pointer Table
    Le Wang
    Lin Feng
    Bo Jin
    [J]. International Journal of Computational Intelligence Systems, 2014, 7 : 25 - 36