Mining sequential patterns from probabilistic databases

被引:0
|
作者
Muhammad Muzammal
Rajeev Raman
机构
[1] Bahria University,Department of Computer Science
[2] University of Leicester,Department of Computer Science
来源
关键词
Mining uncertain data; Sequential pattern mining; Probabilistic databases;
D O I
暂无
中图分类号
学科分类号
摘要
This paper considers the problem of sequential pattern mining (SPM) in probabilistic databases. Specifically, we consider SPM in situations where there is uncertainty in associating an event with a source, model this kind of uncertainty in the probabilistic database framework and consider the problem of enumerating all sequences whose expected support is sufficiently large. We give an algorithm based on dynamic programming to compute the expected support of a sequential pattern. Next, we propose three algorithms for mining sequential patterns from probabilistic databases. The first two algorithms are based on the candidate generation framework—one each based on a breadth-first (similar to GSP) and a depth-first (similar to SPAM) exploration of the search space. The third one is based on the pattern-growth framework (similar to PrefixSpan). We propose optimizations that mitigate the effects of the expensive dynamic programming computation step. We give an empirical evaluation of the probabilistic SPM algorithms and the optimizations and demonstrate the scalability of the algorithms in terms of CPU time and the memory usage. We also demonstrate the effectiveness of the probabilistic SPM framework in extracting meaningful sequences in the presence of noise.
引用
收藏
页码:325 / 358
页数:33
相关论文
共 50 条
  • [21] BigSAM: Mining Interesting Patterns from Probabilistic Databases of Uncertain Big Data
    Jiang, Fan
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    [J]. TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 780 - 792
  • [22] Mining of high utility-probability sequential patterns from uncertain databases
    Zhang, Binbin
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Lie, Ting
    [J]. PLOS ONE, 2017, 12 (07):
  • [23] Incremental Mining of High Utility Sequential Patterns in Incremental Databases
    Wang, Jun-Zhe
    Huang, Jiun-Long
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2341 - 2346
  • [24] Mining High-Utility Sequential Patterns in Uncertain Databases
    Lin, Jerry Chun-Wei
    Srivastava, Gautam
    Li, Yuanfa
    Hong, Tzung-Pei
    Wang, Shyue-Liang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5373 - 5380
  • [25] An Efficient Approach for Mining Weighted Sequential Patterns in Dynamic Databases
    Ishita, Sabrina Zaman
    Noor, Faria
    Ahmed, Chowdhury Farhan
    [J]. ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 215 - 229
  • [26] Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases
    Zhao, Zhou
    Yan, Da
    Ng, Wilfred
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1171 - 1184
  • [27] Mining Time-Interval Sequential Patterns with High Utility from Transaction Databases
    Wang, Wen-Yen
    Huang, Anna Y. -Q.
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (06) : 1018 - 1026
  • [28] Efficient Mining of High Average-Utility Sequential Patterns from Uncertain Databases
    Lin, Jerry Chun-Wei
    Wu, Jimmy Ming-Tai
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Li, Ting
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1989 - 1994
  • [29] Mining typical patterns from databases
    Hu, Hui-Ling
    Chen, Yen-Liang
    [J]. INFORMATION SCIENCES, 2008, 178 (19) : 3683 - 3696
  • [30] Mining direct and indirect fuzzy sequential patterns in large transaction databases
    Ouyang, Weimin
    Huang, Qinhua
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 180 - +