IOMRA - A High Efficiency Frequent Itemset Mining Algorithm Based on the MapReduce Computation Model

被引:0
|
作者
Liu, Sheng-Hui [1 ]
Liu, Shi-Jia [1 ]
Chen, Shi-Xuan [2 ]
Yu, Kun-Ming [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Software, Harbin, Heilongjiang, Peoples R China
[2] Chung Hua Univ, Dept Comp Sci & Informat Engn, Hsinchu, Taiwan
关键词
Frequent Itemset Mining; Aprior; MapReduce; Hadoop;
D O I
10.1109/CSE.2014.247
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The goal of Frequent Itemset Mining (FIM) is to find the biggest number of frequently used subsets from a big transaction database. In previous studies, using the advantage of multicore computing, the execution time of an Apriori algorithm was sharply decreased: when the size of a data set was more than TBs and a single host had been unable to afford a large number of operations by using a number of computers connected into a super computer to speed up execution as being the obvious solution. Some parallel Apriori algorithms, based on the MapReduce framework, have been proposed. However, with these algorithms, memory would be quickly exhausted and communication cost would rise sharply. This would greatly reduce execution efficiency. In this paper, we present an improved reformative Apriori algorithm that uses the length of each transaction to determine the size of the maximum merge candidate itemsets. By reducing the production of low frequency itemsets in Map function, memory exhaustion is ameliorated, greatly improving execution efficiency.
引用
收藏
页码:1290 / 1295
页数:6
相关论文
共 50 条
  • [41] An Incremental Algorithm for Frequent Itemset Mining on Spark
    Yu, Min
    Zuo, Chuang
    Yuan, Yunpeng
    Yang, Yulu
    [J]. 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 281 - 285
  • [42] A Parallel MapReduce Algorithm to Efficiently Support Itemset Mining on High Dimensional Data
    Apiletti, Daniele
    Baralis, Elena
    Cerquitelli, Tania
    Garza, Paolo
    Pulvirenti, Fabio
    Michiardi, Pietro
    [J]. BIG DATA RESEARCH, 2017, 10 : 53 - 69
  • [43] Human resource recommendation algorithm based on improved frequent itemset mining
    Zhaoshan, Liu
    Yiming, Ma
    Huihua, Zheng
    Dege, Liu
    Jing, Liu
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 126 : 284 - 288
  • [44] Frequent Itemset Mining Algorithm Based on Differential Privacy in Vertical Structure
    Long, Shigong
    Lu, Hongqin
    Chen, Tingting
    Zhou, Nannan
    Liu, Hai
    [J]. International Journal of Network Security, 2022, 24 (01) : 75 - 82
  • [45] Bit Splicing Frequent Itemset Mining Algorithm Based on Dynamic Grouping
    Xu, Wenhe
    Lu, Jun
    [J]. WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 417 - 432
  • [46] Evaluation modeling in English teaching based on frequent itemset mining algorithm
    Cui, Yuexia
    [J]. ENGINEERING TECHNOLOGY AND APPLICATIONS, 2014, : 199 - 205
  • [47] Pruning Optimization in Frequent Itemset Mining Algorithm Based on Bit Combination
    Lu, Jun
    Zhou, Kailong
    Guo, Zhicong
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 115 - 116
  • [48] An efficient algorithm for frequent itemset mining on data streams
    Xie Zhi-Jun
    Chen Hong
    Li, Cuiping
    [J]. ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 474 - 491
  • [49] New approach in Big Data Mining for frequent itemset using mapreduce in HDFS
    Nikam, Pallavi V.
    Deshpande, Deepa S.
    [J]. 2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [50] Parallel Analytical Model for Frequent Itemset Mining
    Poorva, K.
    Anushree, H. K.
    Mahesha, K., V
    Pavithra, T. R.
    Vinutha, D. C.
    Chandini, S. B.
    [J]. 2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 517 - 519