Sliding window based weighted erasable stream pattern mining for stream data applications

被引:38
|
作者
Yun, Unil [1 ]
Lee, Gangin [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Weighted erasable pattern mining; Sliding window; Weight condition; Data stream; Data mining; K FREQUENT PATTERNS; ITEMSETS; ALGORITHM;
D O I
10.1016/j.future.2015.12.012
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As one of the variations in frequent pattern mining, erasable pattern mining discovers patterns with benefits lower than or equal to a user-specified threshold from a product database. Although traditional erasable pattern mining algorithms can perform their own mining operations on static mining environments, they are not suitable for dealing with dynamic data stream environments. In such dynamic data streams, algorithms have to process them immediately with only one database scan in order to consider characteristics of data stream mining. However, previous tree-based erasable pattern mining methods have difficulty in processing dynamic data streams because they need two or more database scans to construct their own tree structures. In addition, they do not also consider specific information of each item within a product database, but they need to conduct mining operations considering such additional information of the items in order to find more useful erasable pattern results. For this reason, in this paper, we propose a weighted erasable pattern mining algorithm suitable for sliding window-based data stream environments. The algorithm employs tree and list data structures for more efficient mining processes and solves the problems of previous erasable pattern mining approaches by using a sliding window-based stream processing technique and an item weight-based pattern pruning method. We compare performance of the proposed algorithm to state-of-the-art tree-based approaches with respect to various real and synthetic datasets. Experimental results show that our method is more efficient and scalable than the competitors in terms of runtime, memory, and pattern generation. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [1] A sliding window algorithm for mining frequent itemsets on data stream
    Liu, Junqiang
    Li, Xiurong
    [J]. DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 637 - 639
  • [2] Fuzzy Frequent Pattern Mining Algorithm Based on Weighted Sliding Window and Type-2 Fuzzy Sets over Medical Data Stream
    Chen, Jing
    Li, Peng
    Fang, Weiqing
    Zhou, Ning
    Yin, Yue
    Zheng, Hui
    Xu, He
    Wang, Ruchuan
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [3] Finding heavy hitters over the sliding window of a weighted data stream
    Hung, Regant Y. S.
    Ting, H. F.
    [J]. LATIN 2008: THEORETICAL INFORMATICS, 2008, 4957 : 699 - 710
  • [4] Damped sliding based utility oriented pattern mining over stream data
    Kim, Heonho
    Yun, Unil
    Baek, Yoonji
    Kim, Hyunsoo
    Nam, Hyoju
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    [J]. Knowledge-Based Systems, 2021, 213
  • [5] Damped sliding based utility oriented pattern mining over stream data
    Kim, Heonho
    Yun, Unil
    Baek, Yoonji
    Kim, Hyunsoo
    Nam, Hyoju
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [6] Online data stream mining of recent frequent itemsets based on sliding window model
    Ren, Jia-Dong
    Li, Ke
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 293 - 298
  • [7] A Mining Frequent Itemsets Algorithm in Stream Data Based on Sliding Time Decay Window
    Lu, Xin
    Jin, Shaonan
    Wang, Xun
    Yuan, Jiao
    Fu, Kun
    Yang, Ke
    [J]. AIPR 2020: 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2020, : 18 - 24
  • [8] A Variable Size Sliding Window Based Frequent Itemsets Mining Algorithm in Data Stream
    Li, Haiqing
    Wang, Lang
    [J]. MATERIALS SCIENCE, ENERGY TECHNOLOGY, AND POWER ENGINEERING I, 2017, 1839
  • [9] Research on Data stream Mining Algorithm for Frequent Itemsets Based on Sliding Window Model
    Wang, Hongmei
    Li, Fentian
    Tang, Dongkai
    Wang, Zeru
    [J]. 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 264 - 268
  • [10] Data Stream Frequent Closed Item Sets Mining Based on Fast Sliding Window
    Chen Zhihua
    Luo Jun
    [J]. MECHANICAL AND ELECTRONICS ENGINEERING III, PTS 1-5, 2012, 130-134 : 3702 - 3707