A new framework for mining weighted periodic patterns in time series databases

被引:52
|
作者
Chanda, Ashis Kumar [1 ]
Ahmed, Chowdhury Farhan [1 ,2 ]
Samiullah, Md [1 ]
Leung, Carson K. [3 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Univ Strasbourg, ICube Lab, Strasbourg, France
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
关键词
Data mining; Time series databases; Periodic pattern; Weighted pattern; Suffix tree; Flexible pattern; DISCOVERY;
D O I
10.1016/j.eswa.2017.02.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining periodic patterns in time series databases is a daunting research task that plays a significant role at decision making in real life applications. There are many algorithms for mining periodic patterns in time series, where all patterns are considered as uniformly same. However, in real life applications, such as market basket analysis, gene analysis and network fault experiment, different types of items are found with several levels of importance. Again, the existing algorithms generate huge periodic patterns in dense databases or in low minimum support, where most of the patterns are not important enough to participate in decision making. Hence, a pruning mechanism is essential to reduce these unimportant patterns. As a purpose of mining only important patterns in a minimal time period, we propose a weight based framework by assigning different weights to different items. Moreover, we develop a novel algorithm, WPPM (Weighted Periodic Pattern Mining Algorithm), in time series databases underlying suffix trie structure. To the best of our knowledge, ours is the first proposal that can mine three types of weighted periodic pattern, (i.e. single, partial, full) in a single run. A pruning method is introduced by following downward property, with respect of the maximum weight of a given database, to discard unimportant patterns. The proposed algorithm presents flexibility to user by providing intermediate unimportant pattern skipping opportunity and setting different starting positions in the time series sequence. The performance of our proposed algorithm is evaluated on real life datasets by varying different parameters. At the same time, a comparison between the proposed and an existing algorithm is shown, where the proposed approach outperformed the existing algorithm in terms of time and pattern generation. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:207 / 224
页数:18
相关论文
共 50 条
  • [41] Research on framework of time series data mining
    Yan, XB
    Li, YJ
    Jin, SW
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 197 - 200
  • [42] Weighted Frequent Subgraph Mining in Weighted Graph Databases
    Shinoda, Masaki
    Ozaki, Tomonobu
    Ohkawa, Takenao
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 58 - +
  • [43] Data mining on time series of sequential patterns
    Visa, A
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 166 - 171
  • [44] A Time-Position Join Method for Periodicity Mining in Time Series Databases
    Li, Chia-En
    Chang, Ye-In
    2016 INTERNATIONAL COMPUTER SYMPOSIUM (ICS), 2016, : 294 - 299
  • [45] Discovering Geo-referenced Periodic-Frequent Patterns in Geo-referenced Time Series Databases
    Ravikumar, Penugonda
    Kiran, R. Uday
    Likhitha, Palla
    Chandrasekhar, T.
    Watanobe, Yutaka
    Zettsu, Koji
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 897 - 906
  • [46] Mining interesting patterns in time-series medical databases: A hybrid approach of multiscale matching and rough clustering
    Hirano, S
    Tsumoto, S
    AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 1043 - 1043
  • [47] Clustering complex time-series databases by using periodic components
    Giordano, Francesco
    La Rocca, Michele
    Parrella, Maria Lucia
    STATISTICAL ANALYSIS AND DATA MINING, 2017, 10 (02) : 89 - 106
  • [48] Efficient Periodicity Mining in Time Series Databases Using Suffix Trees
    Rasheed, Faraz
    Alshalalfa, Mohammed
    Alhajj, Reda
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (01) : 79 - 94
  • [49] Mining frequent sub-trends in time-series databases
    Guo, SY
    Wu, TJ
    PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 3096 - 3100
  • [50] 3P-ECLAT: mining partial periodic patterns in columnar temporal databases
    Veena Pamalla
    Uday Kiran Rage
    Ravikumar Penugonda
    Likhitha Palla
    Yutaka Watanobe
    Sadanori Ito
    Koji Zettsu
    Masashi Toyoda
    Venus vikranth raj Bathala
    Applied Intelligence, 2024, 54 : 657 - 679