ODscan: On-demand database scan approach to mining large itemsets

被引：0

作者：

Alsabbagh, JR ^{[1
]}

机构：

[1] Grand Valley State Univ, Dept Comp Sci & Informat Syst, Allendale, MI 49401 USA

来源：

IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2 | 2003年

关键词：

data mining; frequent patterns; large itemsets; association rules; market basket analysis; Apriori;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The discovery of frequently occurring patterns in large databases is commonly referred to as the market basket analysis problem Solving the problem requires high PO overhead (to perform several scans of the database), large memory resources (to tentatively store candidate patterns until they are actually counted through a database scan), and intensive computatons (to perform subset testing among patterns). Most commercial implementations and published algorithms for solving the problem are based in one way or another on the Apriori principle. We propose an algorithm, ODscan, which is also Apriori-based but is unique in that it attempts to balance the cost of I/O and the memory requirements during the derivation process. In principle, ODscan requires only two scans of the database. Additional scans may be needed only when user-controlled memory requirements are exceeded In that case, a scan results in freeing some of the memory before resuming the process of derivation.

引用

页码：154 / 159

页数：6

共 50 条

[41] Incremental mining large itemsets with constraints in dynamic databases
Li, Naiqian
Shen, Junyi
Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2003, 37 (04): : 359 - 363
[42] Mining Frequent Gradual Itemsets from Large Databases
Di-Jorio, Lisa
Laurent, Anne
Teisseire, Maguelonne
ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 297 - +
[43] An efficient and flexible algorithm for online mining of large itemsets
Jea, KF
Chang, MY
Lin, KC
INFORMATION PROCESSING LETTERS, 2004, 92 (06) : 311 - 316
[44] Mining frequent itemsets in large data warehouses: A novel approach proposed for sparse data sets
Fakhrahmad, S. M.
Jahromi, M. Zolghadri
Sadreddini, M. H.
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 517 - +
[45] Enabling On-Demand Database Computing with MIT SuperCloud Database Management System
Prout, Andrew
Kepner, Jeremy
Michaleas, Peter
Arcand, William
Bestor, David
Bergeron, Bill
Byun, Chansup
Edwards, Lauren
Gadepally, Vijay
Hubbell, Matthew
Mullen, Julie
Rosa, Antonio
Yee, Charles
Reuther, Albert
2015 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2015,
[46] Association Rule Mining: A Graph Based Approach for Mining Frequent Itemsets
Tiwari, Vivek
Tiwari, Vipin
Gupta, Shailendra
Tiwari, Renu
2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 309 - 313
[47] Fast algorithm for mining global frequent itemsets based on distributed database
He, Bo
Wang, Yue
Yang, Wu
Chen, Yuan
ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2006, 4062 : 415 - 420
[48] Mining weighted-frequent-regular itemsets from transactional database
Klangwisan, Kittipa
Amphawan, Komate
2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2017, : 66 - 71
[49] Analysis on High Utility Infrequent ItemSets Mining Over Transactional Database
Shrivastava, Sunidhi
Johari, Punit Kumar
2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 897 - 902
[50] SS-FIM: Single Scan for Frequent Itemsets Mining in Transactional Databases
Djenouri, Youcef
Comuzzi, Marco
Djenouri, Djamel
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 644 - 654

← 1 2 3 4 5 →