Mining Recent Frequent Itemsets over Data Streams with a Time-Sensitive Sliding Window

被引:0
|
作者
Jin, Long [2 ]
Chai, Duck Jin [2 ]
Lee, Jun Wook [1 ]
Ryu, Keun Ho [2 ]
机构
[1] ETRI, Telemat Res Div, Taejon, South Korea
[2] Chungbuk Natl Univ, Database Bioinformat Lab, Cheongju, South Korea
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Consequently, the knowledge embedded in a data stream is more likely to be changed as time goes by. Mining frequent patterns is the one of them and has been widely studied over the last decade. There are several models and approaches, but there is only one study on the time-sensitive sliding window model. This study spends much memory and has a low accuracy. In this paper, we propose an efficient discounting method and a Sketch data structure for solving these problems. This approach has several advantages. (i) The accuracy is increased compared with that of previous techniques. The efficient discounting method not only loses the information about accumulated Count but also decrease many missing true answers. (ii) The memory is saved. The Sketch data structure saves much space. (iii) It is not necessary to have the discount table and reduce significantly the computing time of discounting table. Experiment results show that Our proposed method exactly increases the accuracy and saves the memory and the computing time.
引用
收藏
页码:62 / +
页数:2
相关论文
共 50 条
  • [31] Mining the frequent patterns in an arbitrary sliding window over online data streams
    Li, Guo-Hui
    Chen, Hui
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (10): : 2585 - 2596
  • [32] Online mining closed frequent itemsets over a stream sliding window
    Ao, Fu-Jiang
    Du, Jing
    Yan, Yue-Jin
    Huang, Ke-Di
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2009, 31 (05): : 1235 - 1240
  • [33] A survey on algorithms for mining frequent itemsets over data streams
    Cheng, James
    Ke, Yiping
    Ng, Wilfred
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (01) : 1 - 27
  • [34] Mining of Probabilistic Frequent Itemsets over Uncertain Data Streams
    Liu Lixin
    Zhang Xiaolin
    Zhang Huanxiang
    2014 11TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2014, : 231 - 237
  • [35] A survey on algorithms for mining frequent itemsets over data streams
    James Cheng
    Yiping Ke
    Wilfred Ng
    Knowledge and Information Systems, 2008, 16 : 1 - 27
  • [36] Utility-based Frequent Itemsets in Data Streams using Sliding Window
    Amballoor, Renji George
    Naik, Shankar B.
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 108 - 112
  • [37] An adaptive approximation method to discover frequent itemsets over sliding-window-based data streams
    Li, Chao-Wei
    Jea, Kuen-Fang
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13386 - 13404
  • [38] Mining frequent itemsets in data streams within a time horizon
    Troiano, Luigi
    Scibelli, Giacomo
    DATA & KNOWLEDGE ENGINEERING, 2014, 89 : 21 - 37
  • [39] Mining recent frequent itemsets in data streams by radioactively attenuating strategy
    Jia, LF
    Wang, Z
    Zhou, CG
    Xu, XJ
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 804 - 811
  • [40] A Fast Algorithm for Mining Frequent Closed Itemsets over Stream Sliding Window
    Yen, Show-Jane
    Wu, Cheng-Wei
    Lee, Yue-Shi
    Tseng, Vincent S.
    Hsieh, Chaur-Heh
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 996 - 1002