ODscan: On-demand database scan approach to mining large itemsets

被引:0
|
作者
Alsabbagh, JR [1 ]
机构
[1] Grand Valley State Univ, Dept Comp Sci & Informat Syst, Allendale, MI 49401 USA
关键词
data mining; frequent patterns; large itemsets; association rules; market basket analysis; Apriori;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discovery of frequently occurring patterns in large databases is commonly referred to as the market basket analysis problem Solving the problem requires high PO overhead (to perform several scans of the database), large memory resources (to tentatively store candidate patterns until they are actually counted through a database scan), and intensive computatons (to perform subset testing among patterns). Most commercial implementations and published algorithms for solving the problem are based in one way or another on the Apriori principle. We propose an algorithm, ODscan, which is also Apriori-based but is unique in that it attempts to balance the cost of I/O and the memory requirements during the derivation process. In principle, ODscan requires only two scans of the database. Additional scans may be needed only when user-controlled memory requirements are exceeded In that case, a scan results in freeing some of the memory before resuming the process of derivation.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [21] SEQUENTIAL APPROACH FOR MINING OF TEMPORAL ITEMSETS
    Radhakrishna, Vangipuram
    Aljawarneh, Shadi
    Cheruvu, Aravind
    ICEMIS'18: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON ENGINEERING AND MIS, 2018,
  • [22] Effective on-Demand Mining of Structural Databases
    Pravda, Lukas
    Sehnal, David
    Varekova, Radka Svobodova
    Koca, Jaroslav
    BIOPHYSICAL JOURNAL, 2017, 112 (03) : 348A - 348A
  • [23] Mining frequent closed itemsets for large data
    Fu, HG
    Nguifo, EM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 328 - 335
  • [24] An Approach for Mining Similar Temporal Association Patterns in Single Database Scan
    Radhakrishna, Vangipuram
    Kumar, P. V.
    Janaki, V.
    PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS: VOL 2, 2016, 51 : 607 - 617
  • [25] Reducing Database Scan in Maintaining Erasable Itemsets from Product Deletion
    Hong, Tzung-Pei
    Li, Chia-Che
    Wang, Shyue-Liang
    Lin, Jerry Chun-Wei
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2627 - 2632
  • [26] Mining high utility itemsets for transaction deletion in a dynamic database
    Lin, Chun-Wei
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    INTELLIGENT DATA ANALYSIS, 2015, 19 (01) : 43 - 55
  • [27] Research on frequent itemsets mining algorithm based on relational database
    Wang, Jingyang
    Wang, Huiyong
    Zhang, Dongwen
    Zhou, Wanzhen
    Zhang, Pengpeng
    Journal of Software, 2013, 8 (08) : 1843 - 1850
  • [28] Fast mining of maximum frequent itemsets in distributed multimedia database
    He, Bo
    Tu, Peng
    SECOND WORKSHOP ON DIGITAL MEDIA AND ITS APPLICATION IN MUSEUM & HERITAGE, PROCEEDINGS, 2007, : 359 - +
  • [29] Mining Rare Itemsets on Multi-Level Hierarchies Database
    Ouypornkochagorn, Taweechai
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I, 2012, : 487 - 491
  • [30] Mining itemsets - an approach to longitudinal and incremental association rule mining
    Mooney, C
    Roddick, JF
    DATA MINING III, 2002, 6 : 93 - 102