Mining sequential patterns across multiple sequence databases

被引:19
|
作者
Peng, Wen-Chih [1 ]
Liao, Zhung-Xun [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
Data mining; Sequential pattern mining; Multi-domain sequential patterns; EFFICIENT ALGORITHM; PREFIXSPAN;
D O I
10.1016/j.datak.2009.04.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, given a set of sequence databases across multiple domains, we aim at mining multi-domain sequential patterns, where a multi-domain sequential pattern is a sequence of events whose occurrence time is within a pre-defined time window. We first: propose algorithm Naive in which multiple sequence databases are joined as one sequence database for utilizing traditional sequential pattern mining algorithms (e.g., PrefixSpan). Due to the nature of join operations, algorithm Naive is costly and is developed for comparison purposes. Thus, we propose two algorithms without any join operations for mining multidomain sequential patterns. Explicitly, algorithm IndividualMine derives sequential patterns in each domain and then iteratively combines sequential patterns among sequence databases of multiple domains to derive candidate multi-domain sequential patterns. However, not all sequential patterns mined in the sequence database of each domain are able to form multi-domain sequential patterns. To avoid the mining cost incurred in algorithm IndividualMine, algorithm PropagatedMine is developed. Algorithm PropagatedMine first performs one sequential pattern mining from one sequence database. In light of sequential patterns mined, algorithm PropagatedMine propagates sequential patterns mined to other sequence databases. Furthermore, sequential patterns mined are represented as a lattice structure for further reducing the number of sequential patterns to be propagated. In addition. we develop some mechanisms to allow some empty sets in multi-domain sequential patterns. Performance of the proposed algorithms is comparatively analyzed and sensitivity analysis is conducted. Experimental results show that by exploring propagation and lattice structures, algorithm PropagatedMine outperforms algorithm IndividualMine in terms of efficiency (i.e., the execution time). (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1014 / 1033
页数:20
相关论文
共 50 条
  • [1] Mining Integrated Sequential Patterns From Multiple Databases
    Ezeife, Christie, I
    Aravindan, Vignesh
    Chaturvedi, Ritu
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2020, 16 (01) : 1 - 21
  • [2] Mining integrated sequential patterns from multiple databases
    Ezeife, Christie I.
    Aravindan, Vignesh
    Chaturvedi, Ritu
    [J]. International Journal of Data Warehousing and Mining, 2020, 16 (01): : 1 - 21
  • [3] A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    [J]. ETRI JOURNAL, 2010, 32 (05) : 676 - 686
  • [4] Mining Closed Sequential Patterns in Progressive Databases
    Subramanyam, R. B. V.
    Rao, A. Suresh
    Karnati, Ramesh
    Suvvari, Somaraju
    Somayajulu, D. V. L. N.
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2013, 12 (03)
  • [5] Mining sequential patterns from probabilistic databases
    Muzammal, Muhammad
    Raman, Rajeev
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (02) : 325 - 358
  • [6] Mining Sequential Patterns from Probabilistic Databases
    Muzammal, Muhammad
    Raman, Rajeev
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 210 - 221
  • [7] Mining sequential patterns from probabilistic databases
    Muhammad Muzammal
    Rajeev Raman
    [J]. Knowledge and Information Systems, 2015, 44 : 325 - 358
  • [8] Incremental mining of sequential patterns in large databases
    Masseglia, F
    Poncelet, P
    Teisseire, M
    [J]. DATA & KNOWLEDGE ENGINEERING, 2003, 46 (01) : 97 - 121
  • [9] Mining negative sequential patterns in transaction databases
    Ouyang, Wei-Min
    Huang, Qin-Hua
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 830 - +
  • [10] Mining Fuzzy Sequential Patterns with Fuzzy Time-Intervals in Quantitative Sequence Databases
    Truong Duc Phuong
    Do Van Thanh
    Nguyen Duc Dung
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (02) : 3 - 19