IOMRA - A High Efficiency Frequent Itemset Mining Algorithm Based on the MapReduce Computation Model

被引：0

作者：

Liu, Sheng-Hui ^{[1
]}

Liu, Shi-Jia ^{[1
]}

Chen, Shi-Xuan ^{[2
]}

Yu, Kun-Ming ^{[2
]}

机构：

[1] Harbin Univ Sci & Technol, Sch Software, Harbin, Heilongjiang, Peoples R China

[2] Chung Hua Univ, Dept Comp Sci & Informat Engn, Hsinchu, Taiwan

来源：

2014 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) | 2014年

关键词：

Frequent Itemset Mining; Aprior; MapReduce; Hadoop;

D O I：

10.1109/CSE.2014.247

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The goal of Frequent Itemset Mining (FIM) is to find the biggest number of frequently used subsets from a big transaction database. In previous studies, using the advantage of multicore computing, the execution time of an Apriori algorithm was sharply decreased: when the size of a data set was more than TBs and a single host had been unable to afford a large number of operations by using a number of computers connected into a super computer to speed up execution as being the obvious solution. Some parallel Apriori algorithms, based on the MapReduce framework, have been proposed. However, with these algorithms, memory would be quickly exhausted and communication cost would rise sharply. This would greatly reduce execution efficiency. In this paper, we present an improved reformative Apriori algorithm that uses the length of each transaction to determine the size of the maximum merge candidate itemsets. By reducing the production of low frequency itemsets in Map function, memory exhaustion is ameliorated, greatly improving execution efficiency.

引用

页码：1290 / 1295

页数：6

共 50 条

[41] An Incremental Algorithm for Frequent Itemset Mining on Spark
Yu, Min
Zuo, Chuang
Yuan, Yunpeng
Yang, Yulu
[J]. 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 281 - 285
[42] A Parallel MapReduce Algorithm to Efficiently Support Itemset Mining on High Dimensional Data
Apiletti, Daniele
Baralis, Elena
Cerquitelli, Tania
Garza, Paolo
Pulvirenti, Fabio
Michiardi, Pietro
[J]. BIG DATA RESEARCH, 2017, 10 : 53 - 69
[43] Human resource recommendation algorithm based on improved frequent itemset mining
Zhaoshan, Liu
Yiming, Ma
Huihua, Zheng
Dege, Liu
Jing, Liu
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 126 : 284 - 288
[44] Frequent Itemset Mining Algorithm Based on Differential Privacy in Vertical Structure
Long, Shigong
Lu, Hongqin
Chen, Tingting
Zhou, Nannan
Liu, Hai
[J]. International Journal of Network Security, 2022, 24 (01) : 75 - 82
[45] Bit Splicing Frequent Itemset Mining Algorithm Based on Dynamic Grouping
Xu, Wenhe
Lu, Jun
[J]. WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 417 - 432
[46] Evaluation modeling in English teaching based on frequent itemset mining algorithm
Cui, Yuexia
[J]. ENGINEERING TECHNOLOGY AND APPLICATIONS, 2014, : 199 - 205
[47] Pruning Optimization in Frequent Itemset Mining Algorithm Based on Bit Combination
Lu, Jun
Zhou, Kailong
Guo, Zhicong
[J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 115 - 116
[48] An efficient algorithm for frequent itemset mining on data streams
Xie Zhi-Jun
Chen Hong
Li, Cuiping
[J]. ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 474 - 491
[49] New approach in Big Data Mining for frequent itemset using mapreduce in HDFS
Nikam, Pallavi V.
Deshpande, Deepa S.
[J]. 2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
[50] Parallel Analytical Model for Frequent Itemset Mining
Poorva, K.
Anushree, H. K.
Mahesha, K., V
Pavithra, T. R.
Vinutha, D. C.
Chandini, S. B.
[J]. 2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 517 - 519

← 1 2 3 4 5 →