ODscan: On-demand database scan approach to mining large itemsets

被引:0
|
作者
Alsabbagh, JR [1 ]
机构
[1] Grand Valley State Univ, Dept Comp Sci & Informat Syst, Allendale, MI 49401 USA
关键词
data mining; frequent patterns; large itemsets; association rules; market basket analysis; Apriori;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discovery of frequently occurring patterns in large databases is commonly referred to as the market basket analysis problem Solving the problem requires high PO overhead (to perform several scans of the database), large memory resources (to tentatively store candidate patterns until they are actually counted through a database scan), and intensive computatons (to perform subset testing among patterns). Most commercial implementations and published algorithms for solving the problem are based in one way or another on the Apriori principle. We propose an algorithm, ODscan, which is also Apriori-based but is unique in that it attempts to balance the cost of I/O and the memory requirements during the derivation process. In principle, ODscan requires only two scans of the database. Additional scans may be needed only when user-controlled memory requirements are exceeded In that case, a scan results in freeing some of the memory before resuming the process of derivation.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 50 条
  • [1] An Efficient Three-Scan Approach for Mining High Utility Itemsets
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 414 - 417
  • [2] Mining Frequent Itemsets in Evidential Database
    Samet, Ahmed
    Lefevre, Eric
    Ben Yahia, Sadok
    KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 377 - 388
  • [3] A Novel Dynamic Data Mining Approach Based on Large Itemsets
    Chen, Shiqing
    Tang, Zhihang
    SEVENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, 2008, : 81 - 86
  • [4] Mining frequent itemsets in large databases: The hierarchical partitioning approach
    Tseng, Fan-Chen
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1654 - 1661
  • [5] Mining Noise-Tolerant Frequent Closed Itemsets in Very Large Database
    Chen, Junbo
    Zhou, Bo
    Wang, Xinyu
    Ding, Yiqun
    Chen, Lu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (08): : 1523 - 1533
  • [6] HybridMiner: Mining maximal frequent itemsets using hybrid database representation approach
    Bashir, Shariq
    Baig, A. Rauf
    PROCEEDINGS OF THE INMIC 2005: 9TH INTERNATIONAL MULTITOPIC CONFERENCE - PROCEEDINGS, 2005, : 315 - 321
  • [7] Selective Database Projections Based Approach for Mining High-Utility Itemsets
    Bai, Anita
    Deshpande, Parag S.
    Dhabu, Meera
    IEEE ACCESS, 2018, 6 : 14389 - 14409
  • [8] Integrating frequent itemsets mining with relational database
    Qiu Yong
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 543 - 546
  • [9] MINING INTERESTING ITEMSETS FROM TRANSACTIONAL DATABASE
    Sumanga, K.
    Aishwarya, R.
    Hemavathi, E.
    Niraimathi, A.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 554 - 557
  • [10] Parallel mining of top-k frequent itemsets in very large text database
    Wang, YH
    Jia, Y
    Yang, SQ
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 706 - 712