ODscan: On-demand database scan approach to mining large itemsets

被引:0
|
作者
Alsabbagh, JR [1 ]
机构
[1] Grand Valley State Univ, Dept Comp Sci & Informat Syst, Allendale, MI 49401 USA
关键词
data mining; frequent patterns; large itemsets; association rules; market basket analysis; Apriori;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discovery of frequently occurring patterns in large databases is commonly referred to as the market basket analysis problem Solving the problem requires high PO overhead (to perform several scans of the database), large memory resources (to tentatively store candidate patterns until they are actually counted through a database scan), and intensive computatons (to perform subset testing among patterns). Most commercial implementations and published algorithms for solving the problem are based in one way or another on the Apriori principle. We propose an algorithm, ODscan, which is also Apriori-based but is unique in that it attempts to balance the cost of I/O and the memory requirements during the derivation process. In principle, ODscan requires only two scans of the database. Additional scans may be needed only when user-controlled memory requirements are exceeded In that case, a scan results in freeing some of the memory before resuming the process of derivation.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [41] Incremental mining large itemsets with constraints in dynamic databases
    Li, Naiqian
    Shen, Junyi
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2003, 37 (04): : 359 - 363
  • [42] Mining Frequent Gradual Itemsets from Large Databases
    Di-Jorio, Lisa
    Laurent, Anne
    Teisseire, Maguelonne
    ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 297 - +
  • [43] An efficient and flexible algorithm for online mining of large itemsets
    Jea, KF
    Chang, MY
    Lin, KC
    INFORMATION PROCESSING LETTERS, 2004, 92 (06) : 311 - 316
  • [44] Mining frequent itemsets in large data warehouses: A novel approach proposed for sparse data sets
    Fakhrahmad, S. M.
    Jahromi, M. Zolghadri
    Sadreddini, M. H.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 517 - +
  • [45] Enabling On-Demand Database Computing with MIT SuperCloud Database Management System
    Prout, Andrew
    Kepner, Jeremy
    Michaleas, Peter
    Arcand, William
    Bestor, David
    Bergeron, Bill
    Byun, Chansup
    Edwards, Lauren
    Gadepally, Vijay
    Hubbell, Matthew
    Mullen, Julie
    Rosa, Antonio
    Yee, Charles
    Reuther, Albert
    2015 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2015,
  • [46] Association Rule Mining: A Graph Based Approach for Mining Frequent Itemsets
    Tiwari, Vivek
    Tiwari, Vipin
    Gupta, Shailendra
    Tiwari, Renu
    2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 309 - 313
  • [47] Fast algorithm for mining global frequent itemsets based on distributed database
    He, Bo
    Wang, Yue
    Yang, Wu
    Chen, Yuan
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2006, 4062 : 415 - 420
  • [48] Mining weighted-frequent-regular itemsets from transactional database
    Klangwisan, Kittipa
    Amphawan, Komate
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2017, : 66 - 71
  • [49] Analysis on High Utility Infrequent ItemSets Mining Over Transactional Database
    Shrivastava, Sunidhi
    Johari, Punit Kumar
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 897 - 902
  • [50] SS-FIM: Single Scan for Frequent Itemsets Mining in Transactional Databases
    Djenouri, Youcef
    Comuzzi, Marco
    Djenouri, Djamel
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 644 - 654