Mining sequential patterns from probabilistic databases

被引:0
|
作者
Muhammad Muzammal
Rajeev Raman
机构
[1] Bahria University,Department of Computer Science
[2] University of Leicester,Department of Computer Science
来源
关键词
Mining uncertain data; Sequential pattern mining; Probabilistic databases;
D O I
暂无
中图分类号
学科分类号
摘要
This paper considers the problem of sequential pattern mining (SPM) in probabilistic databases. Specifically, we consider SPM in situations where there is uncertainty in associating an event with a source, model this kind of uncertainty in the probabilistic database framework and consider the problem of enumerating all sequences whose expected support is sufficiently large. We give an algorithm based on dynamic programming to compute the expected support of a sequential pattern. Next, we propose three algorithms for mining sequential patterns from probabilistic databases. The first two algorithms are based on the candidate generation framework—one each based on a breadth-first (similar to GSP) and a depth-first (similar to SPAM) exploration of the search space. The third one is based on the pattern-growth framework (similar to PrefixSpan). We propose optimizations that mitigate the effects of the expensive dynamic programming computation step. We give an empirical evaluation of the probabilistic SPM algorithms and the optimizations and demonstrate the scalability of the algorithms in terms of CPU time and the memory usage. We also demonstrate the effectiveness of the probabilistic SPM framework in extracting meaningful sequences in the presence of noise.
引用
收藏
页码:325 / 358
页数:33
相关论文
共 50 条
  • [11] A fast algorithm for mining sequential patterns from large databases
    Ning Chen
    An Chen
    Longxiang Zhou
    Lu Liu
    Journal of Computer Science and Technology, 2001, 16 : 359 - 370
  • [12] Mining Closed Sequential Patterns in Progressive Databases
    Subramanyam, R. B. V.
    Rao, A. Suresh
    Karnati, Ramesh
    Suvvari, Somaraju
    Somayajulu, D. V. L. N.
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2013, 12 (03)
  • [13] Mining negative sequential patterns in transaction databases
    Ouyang, Wei-Min
    Huang, Qin-Hua
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 830 - +
  • [14] Incremental mining of sequential patterns in large databases
    Masseglia, F
    Poncelet, P
    Teisseire, M
    DATA & KNOWLEDGE ENGINEERING, 2003, 46 (01) : 97 - 121
  • [15] Interactive Mining of Probabilistic Frequent Patterns in Uncertain Databases
    Lin, Ming-Yen
    Fu, Cheng-Tai
    Hsueh, Sue-Chen
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2022, 30 (02) : 263 - 283
  • [16] The MineSP operator for mining sequential patterns in inductive databases
    Benitez-Guerrero, Edgard
    Hernandez-Lopez, Alma-Rosa
    MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 684 - +
  • [17] Mining sequential patterns across multiple sequence databases
    Peng, Wen-Chih
    Liao, Zhung-Xun
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (10) : 1014 - 1033
  • [18] Mining Rare Sequential Patterns in Large Transaction Databases
    Ouyang, Weimin
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2016, 48 : 159 - 162
  • [19] Mining Weighted a Closed Sequential Patterns in Large Databases
    Ren, Jia-Dong
    Yang, Jing
    Li, Yan
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 640 - 644
  • [20] Mining weighted sequential patterns in incremental uncertain databases
    Roy, Kashob Kumar
    Moon, Md Hasibul Haque
    Rahman, Md Mahmudur
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    INFORMATION SCIENCES, 2022, 582 : 865 - 896