An efficient approach to mine flexible periodic patterns in time series databases

被引:22
|
作者
Chanda, Ashis Kumar [1 ]
Saha, Swapnil [1 ]
Nishi, Manziba Akanda [2 ]
Samiullah, Md. [1 ]
Ahmed, Chowdhury Farhan [3 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangladesh Open Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[3] Univ Strasbourg, ICube Lab, Strasbourg, France
关键词
Data mining; Time series databases; Periodic pattern; Suffix tree; Flexible patterns; Knowledge discovery; ONLINE;
D O I
10.1016/j.engappai.2015.04.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Periodic pattern mining in time series databases is one of the most interesting data mining problems that is frequently appeared in many real-life applications. Some of the existing approaches find fixed length periodic patterns by using suffix tree structure, i.e., unable to mine flexible patterns. One of the existing approaches generates periodic patterns by skipping intermediate events, i.e., flexible patterns, using apriori based sequential pattern mining approach. Since, apriori based approaches suffer from the issues of huge amount of candidate generation and large percentage of false pattern pruning, we propose an efficient algorithm FPPM (Flexible Periodic Pattern Mining) using suffix trie data structure. The proposed algorithm can capture more effective variable length flexible periodic patterns by neglecting unimportant or undesired events and considering only the important events in an efficient way. To the best of our knowledge, ours is the first approach that simultaneously handles various starting position throughout the sequences, flexibility among events in the mined patterns and interactive tuning of period values on the go. Complexity analysis of the proposed approach and comparison with existing approaches along with analytical comparison on various issues have been performed. As well as extensive experimental analyses are conducted to evaluate the performance of proposed FPPM algorithm using real-life datasets. The proposed approach outperforms existing algorithms in terms of processing time, scalability, and quality of mined patterns. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:46 / 63
页数:18
相关论文
共 50 条
  • [31] Towards Efficient Discovery of Stable Periodic Patterns in Big Columnar Temporal Databases
    Dao, Hong N.
    Ravikumar, Penugonda
    Likitha, P.
    Raj, Bathala Venus Vikranth
    Kiran, R. Uday
    Watanobe, Yutaka
    Paik, Incheon
    [J]. ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 831 - 843
  • [32] An Efficient Approach for Mining Weighted Sequential Patterns in Dynamic Databases
    Ishita, Sabrina Zaman
    Noor, Faria
    Ahmed, Chowdhury Farhan
    [J]. ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 215 - 229
  • [33] An efficient approach to extracting approximate repeating patterns in music databases
    Liu, NH
    Wu, YH
    Chen, ALP
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 240 - 252
  • [34] Efficient subsequence matching in time series databases under time and amplitude transformations
    Argyros, T
    Ermopoulos, C
    [J]. THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 481 - 484
  • [35] Efficient discovery of unusual patterns in time series
    Lonardi S.
    Lin J.
    Keogh E.
    Chiu B.
    [J]. New Generation Computing, 2006, 25 (1) : 61 - 93
  • [36] Efficient discovery of unusual patterns in time series
    Lonardi, Stefano
    Lin, Jessica
    Keogh, Eamonn
    Chiu, Bill 'Yuan-chi'
    [J]. NEW GENERATION COMPUTING, 2007, 25 (01) : 61 - 93
  • [37] An efficient approach for mining periodic sequential access patterns
    Zhou, BY
    Hui, SC
    Fong, ACM
    [J]. PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 485 - 494
  • [38] Efficiently finding arbitrarily scaled patterns in massive time series Databases
    Keogh, E
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 : 253 - 265
  • [39] An Efficient Interval-Based Approach to Mining Frequent Patterns in a Time Series Database
    Phan Thi Bao Tran
    Vo Thi Ngoc Chau
    Duong Tuan Anh
    [J]. MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 211 - 222
  • [40] An application to mine time-series data from remote clinical trial databases
    Dong, FG
    Eslava, S
    Leon, M
    [J]. MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 573 - 573