A sampling-based method for mining frequent patterns from databases

被引:0
|
作者
Chen, YL [1 ]
Ho, CY [1 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent item sets (frequent patterns) in transaction databases is a well known problem in data mining research. This work proposes a sampling-based method to find frequent patterns. The proposed method contains three phases. In the first phase, we draw a small sample of data to estimate the set of frequent patterns, denoted as F-S. The second phase computes the actual supports of the patterns in F-S as well as identifies a subset of patterns in F-S that need to be further examined in the next phase. Finally, the third phase explores this set and finds all missing frequent patterns. The empirical results show that our algorithm is efficient, about two or three times faster than the well-known FP-growth algorithm.
引用
收藏
页码:536 / 545
页数:10
相关论文
共 50 条
  • [1] Mining Frequent Patterns from Hypergraph Databases
    Alam, Md Tanvir
    Ahmed, Chowdhury Farhan
    Samiullah, Md
    Leung, Carson K.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 3 - 15
  • [2] Mining Approximate Frequent Patterns From Noisy Databases
    Yu, Xiaomei
    Li, Yongqin
    Wang, Hong
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 400 - 403
  • [3] Mining image frequent patterns based on a frequent pattern list in image databases
    Ye-In Chang
    Jun-Hong Shen
    Chia-En Li
    Zih-Siang Chen
    Ming-Hsuan Tu
    The Journal of Supercomputing, 2020, 76 : 2597 - 2621
  • [4] Mining image frequent patterns based on a frequent pattern list in image databases
    Chang, Ye-In
    Shen, Jun-Hong
    Li, Chia-En
    Chen, Zih-Siang
    Tu, Ming-Hsuan
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (04): : 2597 - 2621
  • [5] Differentially Private Frequent Sequence Mining via Sampling-based Candidate Pruning
    Xu, Shengzhi
    Su, Sen
    Cheng, Xiang
    Li, Zhengyi
    Xiong, Li
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1035 - 1046
  • [6] Mining frequent δ-free patterns in large databases
    Hébert, C
    Crémilleux, B
    DISCOVERY SCIENCE, PROCEEDINGS, 2005, 3735 : 124 - 136
  • [7] Parallel mining of frequent patterns in transactional databases
    Fakhrahmad, S. M.
    Fard, G. H. Dastghaibi
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 605 - +
  • [8] Mining frequent spatial patterns in image databases
    Chen, Wei-Ta
    Chen, Yi-Ling
    Chen, Ming-Syan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 699 - 703
  • [9] Mining frequent closed patterns in pointset databases
    Lee, Anthony J. T.
    Tsao, Wen-Kwang
    Chen, Po-Yin
    Lin, Ming-Chih
    Yang, Shih-Hui
    INFORMATION SYSTEMS, 2010, 35 (03) : 335 - 351
  • [10] Mining Weighted Frequent Patterns in Incremental Databases
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 933 - 938