A sampling-based method for mining frequent patterns from databases

被引:0
|
作者
Chen, YL [1 ]
Ho, CY [1 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent item sets (frequent patterns) in transaction databases is a well known problem in data mining research. This work proposes a sampling-based method to find frequent patterns. The proposed method contains three phases. In the first phase, we draw a small sample of data to estimate the set of frequent patterns, denoted as F-S. The second phase computes the actual supports of the patterns in F-S as well as identifies a subset of patterns in F-S that need to be further examined in the next phase. Finally, the third phase explores this set and finds all missing frequent patterns. The empirical results show that our algorithm is efficient, about two or three times faster than the well-known FP-growth algorithm.
引用
收藏
页码:536 / 545
页数:10
相关论文
共 50 条
  • [21] An improved sampling-based DBSCAN for large spatial databases
    Borah, B
    Bhattacharyya, DK
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, 2004, : 92 - 96
  • [22] Mining frequent trajectory patterns in spatial-temporal databases
    Lee, Anthony J. T.
    Chen, Yi-An
    Ip, Weng-Chong
    INFORMATION SCIENCES, 2009, 179 (13) : 2218 - 2231
  • [23] Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases
    Zhao, Zhou
    Yan, Da
    Ng, Wilfred
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1171 - 1184
  • [24] AN EFFICIENT ITEMSET REPRESENTATION FOR MINING FREQUENT PATTERNS IN TRANSACTIONAL DATABASES
    Tomovic, Savo
    Stanisic, Predrag
    COMPUTING AND INFORMATICS, 2018, 37 (04) : 894 - 914
  • [25] TidFP: Mining Frequent Patterns in Different Databases with Transaction ID
    Ezeife, C. I.
    Zhang, Dan
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2009, 5691 : 125 - 137
  • [26] Probabilistic Frequent Itemset Mining Algorithm over Uncertain Databases with Sampling
    Li, Hai-Feng
    Zhang, Ning
    Zhang, Yue-Jin
    Wang, Yue
    FUZZY SYSTEMS AND DATA MINING II, 2016, 293 : 159 - 166
  • [27] Mining Frequent Itemsets from Multidimensional Databases
    Bay Vo
    Bac Le
    Nguyen, Thang N.
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT I, 2011, 6591 : 177 - 186
  • [28] Mining frequent weighted utility patterns with dynamic weighted items from quantitative databases
    Nguyen, Ham
    Le, Nguyen
    Bui, Huong
    Le, Tuong
    APPLIED INTELLIGENCE, 2023, 53 (16) : 19629 - 19646
  • [29] Mining frequent weighted utility patterns with dynamic weighted items from quantitative databases
    Ham Nguyen
    Nguyen Le
    Huong Bui
    Tuong Le
    Applied Intelligence, 2023, 53 : 19629 - 19646
  • [30] A formal treatment of the sampling-based approach to managing image databases
    Vu, K
    Hua, KA
    Lang, SD
    PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 64 - 68