ScadiBino: An effective MapReduce-based association rule mining method

被引:4
|
作者
Barkhordari, Mohammadhossein [1 ]
Niamanesh, Mahdi [1 ]
机构
[1] Informat & Commun Technol Res Ctr, Coll Intersect, 5 Saeedi Alley,Enghelab St, Tehran, Iran
关键词
Big data; Data mining; MapReduce; Association rules; PARALLEL;
D O I
10.1145/2617848.2617853
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current data mining algorithms are impractical for huge amounts of data because they are time consuming and therefore inefficient. Association rule mining is one of the most famous data mining algorithms. Many parallel and distributed methods have been proposed for association rule mining. However, these methods are not suited to big data for a number of reasons, such as improper data location, data skewness, lack of load balancing, lack of support for generalized association rule mining, and lack of an obvious method for rule extraction. The MapReduce-based architecture is a parallel and distributable solution for association rule mining. To improve the performance of MapReduce, proposed methods for association rules need to be customized. The performance of iterative algorithms in MapReduce architectures may not be optimum. Two main issues affect the performance of MapReduce architectures: data placement and network traffic. In this paper, a scalable and distributable binominal association rule mining method (ScaDiBino ARM) is proposed. This method converts input data items to binominal format to take advantage of scalable and distributable attributes of MapReduce structures. The proposed method was evaluated by applying it to real traffic data of a mobile operator to enable it to recommend values added services (VAS) to its customers. The results show that the rule extraction time improved significantly after applying the proposed rule mining method.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [1] An efficient MapReduce-based rule matching method for production system
    Li, Ying
    Liu, Weiwei
    Cao, Bin
    Yin, Jianwei
    Yao, Min
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 54 : 478 - 489
  • [2] PFIMD: a parallel MapReduce-based algorithm for frequent itemset mining
    Mao Yimin
    Geng Junhao
    Deborah Simon Mwakapesa
    Yaser Ahangari Nanehkaran
    Zhang Chi
    Deng Xiaoheng
    Chen Zhigang
    [J]. Multimedia Systems, 2021, 27 : 709 - 722
  • [3] Fuzzy improved firefly-based MapReduce for association rule mining
    Driff L.N.
    Drias H.
    [J]. International Journal of Innovative Computing and Applications, 2023, 14 (1-2) : 104 - 123
  • [4] A MapReduce-based approach to social network big data mining
    Qi, Fuli
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2023, 23 (05) : 2535 - 2547
  • [5] MapReduce-based Frequent Itemset Mining for Analysis of Electronic Evidence
    Jiang, Xueqing
    Sun, Guozi
    [J]. 2013 EIGHTH INTERNATIONAL WORKSHOP ON SYSTEMATIC APPROACHES TO DIGITAL FORENSIC ENGINEERING (SADFE), 2013,
  • [6] PFIMD: a parallel MapReduce-based algorithm for frequent itemset mining
    Mao, Yimin
    Geng, Junhao
    Mwakapesa, Deborah Simon
    Nanehkaran, Yaser Ahangari
    Chi, Zhang
    Deng, Xiaoheng
    Chen, Zhigang
    [J]. MULTIMEDIA SYSTEMS, 2021, 27 (04) : 709 - 722
  • [7] A MapReduce-Based User Identification Algorithm in Web Usage Mining
    Srivastava, Mitali
    Garg, Rakhi
    Mishra, P. K.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2018, 13 (02) : 11 - 23
  • [8] An effective method for privacy preserving association rule mining
    Zhang, Peng
    Tong, Yun-Hai
    Tang, Shi-Wei
    Yang, Dong-Qing
    Ma, Xiu-Li
    [J]. Ruan Jian Xue Bao/Journal of Software, 2006, 17 (08): : 1764 - 1774
  • [9] A mapreduce-based adjoint method for preventing brain disease
    Zettam M.
    Laassiri J.
    Enneya N.
    [J]. Zettam, Manal (zettammanal@gmail.com), 2018, SpringerOpen (05)
  • [10] ScaDiGraph: A MapReduce-Based Method for Solving Graph Problems
    Barkhordari, Mohammadhossein
    Niamanesh, Mardi
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2017, 33 (01) : 143 - 158