Summarizing Probabilistic Frequent Patterns: A Fast Approach

被引:0
|
作者
Liu, Chunyang [1 ]
Chen, Ling [1 ]
Zhang, Chengqi [1 ]
机构
[1] Univ Technol Sydney, QCIS, Sydney, NSW, Australia
基金
澳大利亚研究理事会;
关键词
Pattern Summarization; Uncertain Data;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining probabilistic frequent patterns from uncertain data has received a great deal of attention in recent years due to the wide applications. However, probabilistic frequent pattern mining suffers from the problem that an exponential number of result patterns are generated, which seriously hinders further evaluation and analysis. In this paper, we focus on the problem of mining probabilistic representative frequent patterns (P-RFP), which is the minimal set of patterns with adequately high probability to represent all frequent patterns. Observing the bottleneck in checking whether a pattern can probabilistically represent another, which involves the computation of a joint probability of the supports of two patterns, we introduce a novel approximation of the joint probability with both theoretical and empirical proofs. Based on the approximation, we propose an Approximate P-RFP Mining (APM) algorithm, which effectively and efficiently compresses the set of probabilistic frequent patterns. To our knowledge, this is the first attempt to analyze the relationship between two probabilistic frequent patterns through an approximate approach. Our experiments on both synthetic and real-world datasets demonstrate that the APM algorithm accelerates P-RFP mining dramatically, orders of magnitudes faster than an exact solution. Moreover, the error rate of APM is guaranteed to be very small when the database contains hundreds transactions, which further affirms APM is a practical solution for summarizing probabilistic frequent patterns.
引用
收藏
页码:527 / 535
页数:9
相关论文
共 50 条
  • [31] A Fast Approach for Up-Scaling Frequent Itemsets
    Chen, Runzi
    Zhao, Shuliang
    Liu, Mengmeng
    IEEE ACCESS, 2020, 8 : 97141 - 97151
  • [32] Effective algorithms for vertical mining probabilistic frequent patterns in uncertain mobile environments
    Yu, Xiaomei
    Wang, Hong
    Zheng, Xiangwei
    Wang, Yilei
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2016, 23 (3-4) : 137 - 151
  • [33] Syllabification with Frequent Sequence Patterns A Language Independent Approach
    Bona, Adrian
    Lemnaru, Camelia
    Potolea, Rodica
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 352 - 359
  • [34] An Approach to Mine Significant Frequent Patterns by Quantity Attribute
    Rathod, Arti
    Dhabariya, Ajaysingh
    Thacker, Chintan
    2014 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2014, : 414 - 418
  • [35] An Efficient Approach for Updating the Structure for Mining Frequent Patterns
    Yen, Show-Jane
    Lee, Yue-Shi
    Gu, Jia-Yuan
    2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2012, : 879 - 883
  • [36] Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach
    Jiawei Han
    Jian Pei
    Yiwen Yin
    Runying Mao
    Data Mining and Knowledge Discovery, 2004, 8 : 53 - 87
  • [37] Mining frequent patterns without candidate generation: A frequent-pattern tree approach
    Han, JW
    Pei, J
    Yin, YW
    Mao, RY
    DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) : 53 - 87
  • [38] Balancing Tree Size and Accuracy in Fast Mining of Uncertain Frequent Patterns
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, 2015, 9263 : 57 - 69
  • [39] A Distributed Method for Fast Mining Frequent Patterns From Big Data
    Huang, Peng-Yu
    Cheng, Wan-Shu
    Chen, Ju-Chin
    Chung, Wen-Yu
    Chen, Young-Lin
    Lin, Kawuu W.
    IEEE ACCESS, 2021, 9 : 135144 - 135159
  • [40] Fast algorithms for mining generalized frequent patterns of generalized association rules
    Sriphaew, K
    Theeramunkong, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (03): : 761 - 770