Erasable pattern mining based on tree structures with damped window over data streams

被引:19
|
作者
Baek, Yoonji [1 ]
Yun, Unil [1 ]
Kim, Heonho [1 ]
Nam, Hyoju [1 ]
Lee, Gangin [1 ]
Yoon, Eunchul [2 ]
Vo, Bay [3 ]
Lin, Jerry Chun-Wei [4 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Konkuk Univ, Dept Elect Engn, Seoul, South Korea
[3] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[4] Western Norway Univ Appl Sci, Dept Comp Sci Elect Engn & Math Sci, Bergen, Norway
基金
新加坡国家研究基金会;
关键词
Erasable pattern mining; Damped window; Tree structures; Pruning technique; Stream mining; EFFICIENT ALGORITHM; FREQUENT PATTERNS; ITEMSETS; TIME; NETWORKS;
D O I
10.1016/j.engappai.2020.103735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several pattern mining methods have been proposed to process dynamic data streams because the data generated in industrial fields is continually accumulated. Erasable pattern mining techniques for processing dynamic data streams are needed to discover erasable patterns from dynamic data streams. In previous erasable pattern mining approaches suggested for dynamic data streams, all data are considered to have the same importance regardless of its timestamp. However, dynamic data streams have the characteristic that the new data is relatively more significant than the old data. In erasable pattern mining, one of the desired techniques is an approach in consideration of such characteristic of data streams. For this reason, we propose an erasable pattern mining algorithm over dynamic data streams based on the damped window model. Since the suggested technique considers the new data more important than the previous data, it can find more useful erasable patterns. In addition, erasable pattern mining based on the damped window model is conducted efficiently by employing the tree and table structures. In performance test, we present that our pruning techniques remove unnecessary operations related to invalid erasable patterns efficiently from damped-window-based data streams. Performance evaluation results using real datasets and synthetic datasets show that the proposed approach has good performance with regard to as execution time, pattern generation, and scalability by comparing between the suggested technique and the state of the art algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Mining Recent Maximal Frequent Itemsets Over Data Streams with Sliding Window
    Cai, Saihua
    Hao, Shangbo
    Sun, Ruizhi
    Wu, Gang
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (06) : 961 - 969
  • [32] Mining weighted frequent itemsets using window sliding over data streams
    Kim, Younghee
    Kim, Wonyoung
    Ryu, Joonsuk
    Kim, Ungmo
    [J]. ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 708 - 713
  • [33] Efficient algorithm for frequent pattern mining over uncertain data streams
    Du, Congqiang
    Shao, Zengzhen
    [J]. Journal of Computational Information Systems, 2015, 11 (21): : 7799 - 7808
  • [34] A New Adaptive Algorithm for Frequent Pattern Mining over Data Streams
    Deypir, Mahmood
    Sadreddini, Mohammad Hadi
    [J]. 2011 1ST INTERNATIONAL ECONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2011, : 230 - 235
  • [35] Window specification over data streams
    Patroumpas, Kostas
    Sellis, Timos
    [J]. CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 4254 : 445 - 464
  • [36] Granular Sketch Based Uncertain Data Streams Pattern Mining
    Chen, Jingyu
    Chen, Ping
    Sheng, Xian'gang
    [J]. INFORMATION COMPUTING AND APPLICATIONS, ICICA 2013, PT I, 2013, 391 : 488 - +
  • [37] MINING FREQUENT PATTERN BASED ON FADING FACTOR IN DATA STREAMS
    Ren, Jia-Dong
    He, Hui-Ling
    Hu, Chang-Zhen
    Xu, Li-Na
    Wang, Li-Bo
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2250 - +
  • [38] Mining Weighted Rare Association Rules Using Sliding Window over Data Streams
    Ouyang, Weimin
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2016, 48 : 116 - 119
  • [39] Mining top-k frequent patterns over data streams sliding window
    Chen, Hui
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 111 - 131
  • [40] Mining Approximate Frequent Itemsets over Data Streams Using Window Sliding Techniques
    Kim, Younghee
    Park, Eunkyoung
    Kim, Ungmo
    [J]. DATABASE THEORY AND APPLICATION, 2009, 64 : 49 - 56