Frequent Itemset Mining for Big Data

被引:0
|
作者
Moens, Sandy [1 ]
Aksehirli, Emin [1 ]
Goethals, Bart [1 ]
机构
[1] Univ Antwerp, Antwerp, Belgium
关键词
distributed data mining; mapreduce; hadoop; eclat; PARALLEL; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent Itemset Mining (FIM) is one of the most well known techniques to extract knowledge from data. The combinatorial explosion of FIM methods become even more problematic when they are applied to Big Data. Fortunately, recent improvements in the field of parallel programming already provide good tools to tackle this problem. However, these tools come with their own technical challenges, e.g. balanced data distribution and inter-communication costs. In this paper, we investigate the applicability of FIM techniques on the MapReduce platform. We introduce two new methods for mining large datasets: Dist-Eclat focuses on speed while BigFIM is optimized to run on really large datasets. In our experiments we show the scalability of our methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Frequent Itemset Mining for Big Data
    Chavan, Kiran
    Kulkarni, Priyanka
    Ghodekar, Pooja
    Patil, S. N.
    [J]. 2015 International Conference on Green Computing and Internet of Things (ICGCIoT), 2015, : 1365 - 1368
  • [2] Recommendation using Frequent Itemset Mining in Big Data
    Kunjachan, Honeytta
    Hareesh, M. J.
    Sreedevi, K. M.
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 561 - 566
  • [3] Iterative sampling based frequent itemset mining for big data
    Wu, Xian
    Fan, Wei
    Peng, Jing
    Zhang, Kun
    Yu, Yong
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 875 - 882
  • [4] MrFIM: A MapReduce Approach for Frequent Itemset Mining in Big Data
    Rahman, Abdul
    Manjaramkar, Arati
    [J]. 2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [5] Iterative sampling based frequent itemset mining for big data
    Xian Wu
    Wei Fan
    Jing Peng
    Kun Zhang
    Yong Yu
    [J]. International Journal of Machine Learning and Cybernetics, 2015, 6 : 875 - 882
  • [6] Finding tendencies in streaming data using Big Data frequent itemset mining
    Fernandez-Basso, Carlos
    Francisco-Agra, Abel J.
    Martin-Bautista, Maria J.
    Dolores Ruiz, M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 666 - 674
  • [7] Efficient Probabilistic Frequent Itemset Mining in Big Sparse Uncertain Data
    Xu, Jing
    Li, Ning
    Mao, Xiao-Jiao
    Yang, Yu-Bin
    [J]. PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 235 - 247
  • [8] Review of Apriori based Frequent Itemset Mining Solutions on Big Data
    Fard, Mohammad Javad Shayegan
    Namin, Parsa Asgari
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 157 - 164
  • [9] Frequent Itemset Mining in Big Data With Effective Single Scan Algorithms
    Djenouri, Youcef
    Djenouri, Djamel
    Lin, Jerry Chun-Wei
    Belhadi, Asma
    [J]. IEEE ACCESS, 2018, 6 : 68013 - 68026
  • [10] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
    Feng Zhang
    Min Liu
    Feng Gui
    Weiming Shen
    Abdallah Shami
    Yunlong Ma
    [J]. Cluster Computing, 2015, 18 : 1493 - 1501