Erasable pattern mining based on tree structures with damped window over data streams

被引:19
|
作者
Baek, Yoonji [1 ]
Yun, Unil [1 ]
Kim, Heonho [1 ]
Nam, Hyoju [1 ]
Lee, Gangin [1 ]
Yoon, Eunchul [2 ]
Vo, Bay [3 ]
Lin, Jerry Chun-Wei [4 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] Konkuk Univ, Dept Elect Engn, Seoul, South Korea
[3] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[4] Western Norway Univ Appl Sci, Dept Comp Sci Elect Engn & Math Sci, Bergen, Norway
基金
新加坡国家研究基金会;
关键词
Erasable pattern mining; Damped window; Tree structures; Pruning technique; Stream mining; EFFICIENT ALGORITHM; FREQUENT PATTERNS; ITEMSETS; TIME; NETWORKS;
D O I
10.1016/j.engappai.2020.103735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several pattern mining methods have been proposed to process dynamic data streams because the data generated in industrial fields is continually accumulated. Erasable pattern mining techniques for processing dynamic data streams are needed to discover erasable patterns from dynamic data streams. In previous erasable pattern mining approaches suggested for dynamic data streams, all data are considered to have the same importance regardless of its timestamp. However, dynamic data streams have the characteristic that the new data is relatively more significant than the old data. In erasable pattern mining, one of the desired techniques is an approach in consideration of such characteristic of data streams. For this reason, we propose an erasable pattern mining algorithm over dynamic data streams based on the damped window model. Since the suggested technique considers the new data more important than the previous data, it can find more useful erasable patterns. In addition, erasable pattern mining based on the damped window model is conducted efficiently by employing the tree and table structures. In performance test, we present that our pruning techniques remove unnecessary operations related to invalid erasable patterns efficiently from damped-window-based data streams. Performance evaluation results using real datasets and synthetic datasets show that the proposed approach has good performance with regard to as execution time, pattern generation, and scalability by comparing between the suggested technique and the state of the art algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Damped window based high average utility pattern mining over data streams
    Yun, Unil
    Kim, Donggyu
    Yoon, Eunchul
    Fujita, Hamido
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 144 : 188 - 205
  • [2] Sliding window-based frequent pattern mining over data streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. INFORMATION SCIENCES, 2009, 179 (22) : 3843 - 3865
  • [3] Sliding window based weighted maximal frequent pattern mining over data streams
    Lee, Gangin
    Yun, Unil
    Ryu, Keun Ho
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (02) : 694 - 708
  • [4] Sliding window based weighted erasable stream pattern mining for stream data applications
    Yun, Unil
    Lee, Gangin
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 59 : 1 - 20
  • [5] SPPC: a new tree structure for mining erasable patterns in data streams
    Le, Tuong
    Vo, Bay
    Fournier-Viger, Philippe
    Lee, Mi Young
    Baik, Sung Wook
    [J]. APPLIED INTELLIGENCE, 2019, 49 (02) : 478 - 495
  • [6] SPPC: a new tree structure for mining erasable patterns in data streams
    Tuong Le
    Bay Vo
    Philippe Fournier-Viger
    Mi Young Lee
    Sung Wook Baik
    [J]. Applied Intelligence, 2019, 49 : 478 - 495
  • [7] High utility pattern mining over data streams with sliding window technique
    Ryang, Heungmo
    Yun, Unil
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 57 : 214 - 231
  • [8] A Variable Sliding Window Algorithm Based on Concept Drift for Frequent Pattern Mining Over Data Streams*
    Yin, Yue
    Li, Peng
    Chen, Jing
    [J]. 2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 818 - 825
  • [9] An Efficient Algorithm for Sliding Window-Based Weighted Frequent Pattern Mining over Data Streams
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (07): : 1369 - 1381
  • [10] Frequent pattern mining algorithm for uncertain data streams based on sliding window
    Yang, Junrui
    Yang, Cai
    Wei, Yanjun
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 265 - 268