Mining Sequential Patterns from Probabilistic Databases

被引:0
|
作者
Muzammal, Muhammad [1 ]
Raman, Rajeev [1 ]
机构
[1] Univ Leicester, Dept Comp Sci, Leicester LE1 7RH, Leics, England
关键词
Mining Uncertain Data; Mining complex sequential data; Probabilistic Databases; Novel models and algorithms; FREQUENT ITEMSETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider sequential pattern mining in situations where there is uncertainty about which source an event is associated with. We model this in the probabilistic database framework and consider the problem of enumerating all sequences whose expected support is sufficiently large. Unlike frequent itemset mining in probabilistic databases [C. Aggarwal et al. KDD'09; Chui et al., PAKDD'07; Chui and Kao, PAKDD'08], we use dynamic programming (DP) to compute the probability that a source supports a sequence, and show that this suffices to compute the expected support of a sequential pattern. Next, we embed this DP algorithm into candidate generate-and-test approaches, and explore the pattern lattice both in a breadth-first (similar to GSP) and a depth-first (similar to SPAM) manner. We propose optimizations for efficiently computing the frequent 1-sequences, for re-using previously-computed results through incremental support computation, and for elmiminating candidate sequences without computing their support via probabilistic pruning. Preliminary experiments show that our optimizations are effective in improving the CPU cost.
引用
收藏
页码:210 / 221
页数:12
相关论文
共 50 条
  • [1] Mining sequential patterns from probabilistic databases
    Muzammal, Muhammad
    Raman, Rajeev
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (02) : 325 - 358
  • [2] Mining sequential patterns from probabilistic databases
    Muhammad Muzammal
    Rajeev Raman
    [J]. Knowledge and Information Systems, 2015, 44 : 325 - 358
  • [3] Mining Sequential Patterns from Probabilistic Databases by Pattern-Growth
    Muzammal, Muhammad
    [J]. ADVANCES IN DATABASES, 2011, 7051 : 118 - 127
  • [4] Mining Integrated Sequential Patterns From Multiple Databases
    Ezeife, Christie, I
    Aravindan, Vignesh
    Chaturvedi, Ritu
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2020, 16 (01) : 1 - 21
  • [5] Mining integrated sequential patterns from multiple databases
    Ezeife, Christie I.
    Aravindan, Vignesh
    Chaturvedi, Ritu
    [J]. International Journal of Data Warehousing and Mining, 2020, 16 (01): : 1 - 21
  • [6] Mining dependent patterns in probabilistic databases
    Zhang, SC
    Zhang, CQ
    Yu, JX
    [J]. CYBERNETICS AND SYSTEMS, 2004, 35 (04) : 399 - 424
  • [7] Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases
    Li, Yuxuan
    Bailey, James
    Kulik, Lars
    Pei, Jian
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 448 - 457
  • [8] A fast algorithm for mining sequential patterns from large databases
    Chen, N
    Chen, A
    Zhou, LX
    Liu, L
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (04) : 359 - 370
  • [9] A Fast Algorithm for Mining Sequential Patterns from Large Databases
    陈宁
    陈安
    周龙骧
    刘鲁
    [J]. Journal of Computer Science & Technology, 2001, (04) : 359 - 370
  • [10] A fast algorithm for mining sequential patterns from large databases
    Chen, N.
    Chen, A.
    Zhou, L.
    Liu, L.
    [J]. 2001, Allerton Press Inc. (16)