ODscan: On-demand database scan approach to mining large itemsets

被引:0
|
作者
Alsabbagh, JR [1 ]
机构
[1] Grand Valley State Univ, Dept Comp Sci & Informat Syst, Allendale, MI 49401 USA
关键词
data mining; frequent patterns; large itemsets; association rules; market basket analysis; Apriori;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discovery of frequently occurring patterns in large databases is commonly referred to as the market basket analysis problem Solving the problem requires high PO overhead (to perform several scans of the database), large memory resources (to tentatively store candidate patterns until they are actually counted through a database scan), and intensive computatons (to perform subset testing among patterns). Most commercial implementations and published algorithms for solving the problem are based in one way or another on the Apriori principle. We propose an algorithm, ODscan, which is also Apriori-based but is unique in that it attempts to balance the cost of I/O and the memory requirements during the derivation process. In principle, ODscan requires only two scans of the database. Additional scans may be needed only when user-controlled memory requirements are exceeded In that case, a scan results in freeing some of the memory before resuming the process of derivation.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [31] Privacy Preserving Frequent Itemsets Mining Based on Database Reconstruction
    Li, Shaoxin
    Mu, Nankun
    Liao, Xiaofeng
    2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 388 - 394
  • [32] Mining Emerging High Utility Itemsets over Streaming Database
    Hackman, Acquah
    Huang, Yu
    Yu, Philip S.
    Tseng, Vincent S.
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 3 - 16
  • [33] An Approach for Mining Concurrently Closed Itemsets and Generators
    Anh Tran
    Tin Truong
    Bac Le
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 355 - 366
  • [34] An adaptive approach to mining frequent itemsets efficiently
    Tseng, Fan-Chen
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (18) : 13166 - 13172
  • [35] A Hybrid Solution of Mining Frequent Itemsets from Uncertain Database
    Yu, Xiaomei
    Wang, Hong
    Zheng, Xiangwei
    INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 581 - 590
  • [36] A novel Approach for mining frequent itemsets: AprioriMin
    Essalmi, Houda
    El Far, Mohamed
    El Mohajir, Mohammed
    Chahhou, Mohamed
    2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 286 - 289
  • [37] RxW: A scheduling approach for large-scale on-demand data broadcast
    Aksoy, D
    Franklin, M
    IEEE-ACM TRANSACTIONS ON NETWORKING, 1999, 7 (06) : 846 - 860
  • [38] An efficient approach for interactive mining of frequent itemsets
    Deng, ZH
    Li, X
    Tang, SW
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 138 - 149
  • [39] An algorithm for discovering the frequent closed itemsets in a large database
    Singh, Ningthoujam Gourakishwar
    Singh, Sanasam Ranbir
    Mahanta, Anjana K.
    Prasad, Bhanu
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2006, 18 (04) : 481 - 499
  • [40] Lenses: An On-Demand Approach to ETL
    Yang, Ying
    Meneghetti, Niccolo
    Fehling, Ronny
    Liu, Zhen Hua
    Kennedy, Oliver
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1578 - 1589