A sampling-based method for mining frequent patterns from databases

被引:0
|
作者
Chen, YL [1 ]
Ho, CY [1 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent item sets (frequent patterns) in transaction databases is a well known problem in data mining research. This work proposes a sampling-based method to find frequent patterns. The proposed method contains three phases. In the first phase, we draw a small sample of data to estimate the set of frequent patterns, denoted as F-S. The second phase computes the actual supports of the patterns in F-S as well as identifies a subset of patterns in F-S that need to be further examined in the next phase. Finally, the third phase explores this set and finds all missing frequent patterns. The empirical results show that our algorithm is efficient, about two or three times faster than the well-known FP-growth algorithm.
引用
收藏
页码:536 / 545
页数:10
相关论文
共 50 条
  • [31] Sampling-based stream mining for network risk management
    Yoshida, Kenichi
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2007, 4384 : 374 - 386
  • [32] Frequent Subgraph Mining in Graph Databases Based on MapReduce
    Wang, Kai
    Xie, Xia
    Jin, Hai
    Yuan, Pingpeng
    Lu, Feng
    Ke, Xijiang
    ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 464 - 476
  • [33] Towards Efficient Mining of Periodic-Frequent Patterns in Transactional Databases
    Kiran, R. Uday
    Reddy, P. Krishna
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT 2, 2010, 6262 : 194 - 208
  • [34] Memory Efficient Mining of Periodic-Frequent Patterns in Transactional Databases
    Anirudh, A.
    Kiran, R. Uday
    Reddy, P. Krishna
    Kitsuregawa, Masaru
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [35] An efficient mining algorithm for maximal weighted frequent patterns in transactional databases
    Yun, Unil
    Shin, Hyeonil
    Ryu, Keun Ho
    Yoon, EunChul
    KNOWLEDGE-BASED SYSTEMS, 2012, 33 : 53 - 64
  • [36] Towards Efficiently Mining Frequent Interval-Based Sequential Patterns in Time Series Databases
    Phan Thi Bao Tran
    Vo Thi Ngoc Chau
    Duong Tuan Anh
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, MIWAI 2015, 2015, 9426 : 125 - 136
  • [37] An Efficient Algorithm for Mining Maximal Frequent Sequential Patterns in Large Databases
    Su, Qiu-bin
    Lu, Lu
    Cheng, Bin
    2018 INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORK AND ARTIFICIAL INTELLIGENCE (CNAI 2018), 2018, : 404 - 410
  • [38] MANIACS: Approximate Mining of Frequent Subgraph Patterns through Sampling
    Preti, Giulia
    Morales, Gianmarco De Francisci
    Riondato, Matteo
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (03)
  • [39] From path tree to frequent patterns: A framework for mining frequent patterns
    Xu, YB
    Yu, JX
    Liu, GM
    Lu, HJ
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 514 - 521
  • [40] Parallel mining of maximal frequent itemsets from databases
    Chung, SM
    Luo, C
    15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 134 - 139