Maintaining frequent closed itemsets over a sliding window

被引:0
|
作者
James Cheng
Yiping Ke
Wilfred Ng
机构
[1] The Hong Kong University of Science and Technology,Department of Computer Science and Engineering
关键词
Frequent Closed Itemset; Data stream mining; Sliding window;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we study the incremental update of Frequent Closed Itemsets (FCIs) over a sliding window in a high-speed data stream. We propose the notion of semi-FCIs, which is to progressively increase the minimum support threshold for an itemset as it is retained longer in the window, thereby drastically reducing the number of itemsets that need to be maintained and processed. We explore the properties of semi-FCIs and observe that a majority of the subsets of a semi-FCI are not semi-FCIs and need not be updated. This finding allows us to devise an efficient algorithm, IncMine, that incrementally updates the set of semi-FCIs over a sliding window. We also develop an inverted index to facilitate the update process. Our empirical results show that IncMine achieves significantly higher throughput and consumes less memory than the state-of-the-art streaming algorithms for mining FCIs and FIs. IncMine also attains high accuracy of 100% precision and over 93% recall.
引用
收藏
页码:191 / 215
页数:24
相关论文
共 50 条
  • [1] Maintaining frequent closed itemsets over a sliding window
    Cheng, James
    Ke, Yiping
    Ng, Wilfred
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2008, 31 (03) : 191 - 215
  • [2] Moment: Maintaining closed frequent itemsets over a stream sliding window
    Chi, Y
    Wang, HX
    Yu, PS
    Muntz, RR
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 59 - 66
  • [3] Catch the moment: maintaining closed frequent itemsets over a data stream sliding window
    Chi, Yun
    Wang, Haixun
    Yu, Philip S.
    Muntz, Richard R.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (03) : 265 - 294
  • [4] Catch the moment: maintaining closed frequent itemsets over a data stream sliding window
    Yun Chi
    Haixun Wang
    Philip S. Yu
    Richard R. Muntz
    [J]. Knowledge and Information Systems, 2006, 10 : 265 - 294
  • [5] Mining Closed Frequent Itemsets in the Sliding Window over Data Stream
    Mao Yinmin
    Yang Lumin
    Li Hong
    Chen Zhigang
    Liu Lixin
    [J]. 2009 IEEE YOUTH CONFERENCE ON INFORMATION, COMPUTING AND TELECOMMUNICATION, PROCEEDINGS, 2009, : 146 - 149
  • [6] Online mining closed frequent itemsets over a stream sliding window
    Ao, Fu-Jiang
    Du, Jing
    Yan, Yue-Jin
    Huang, Ke-Di
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2009, 31 (05): : 1235 - 1240
  • [7] A Fast Algorithm for Mining Frequent Closed Itemsets over Stream Sliding Window
    Yen, Show-Jane
    Wu, Cheng-Wei
    Lee, Yue-Shi
    Tseng, Vincent S.
    Hsieh, Chaur-Heh
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 996 - 1002
  • [8] Finding, Frequent Closed Itemsets in Sliding Window in Linear Time
    Chen, Junbo
    Zhou, Bo
    Chen, Lu
    Wang, Xinyu
    Ding, Yiqun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (10): : 2406 - 2418
  • [9] An Efficient Algorithm for Maintaining Frequent Closed Itemsets over Data Stream
    Yen, Show-Jane
    Lee, Yue-Shi
    Wu, Cheng-Wei
    Lin, Chin-Lin
    [J]. NEXT-GENERATION APPLIED INTELLIGENCE, PROCEEDINGS, 2009, 5579 : 767 - 776
  • [10] A Novel Technique For Mining Closed Frequent Itemsets Using Variable Sliding Window
    Kumar, Vikas
    Satapathy, Sangita Rani
    [J]. SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 504 - 510