Parallelized Frequent Item Set Mining Using a Tall and Skinny Matrix

被引:0
|
作者
Janakiram, D. Pooja [1 ]
机构
[1] Indian Inst Technol Madras, Madras, Tamil Nadu, India
关键词
D O I
10.1109/ICDMW.2016.198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data applications consist of very large collection of small records, for example data from a retail website, data from movie streaming services, sensor data applications and many other such applications. Frequent item set mining is one of the common tools used for all these applications to generate recommendations to improve user experience of the website. Frequent itemset mining is also used to find interesting patterns on scientific databases such as gene expression database. One interesting way to represent such big data applications is by transforming them into tall and skinny matrices. In this paper we explore the concept of tall and skinny matrices to generate frequent item sets. The proposed algorithm is implemented on a map-reduce based framework such as Apache Spark and experiments are performed to test the scalability of the algorithm on a cloud platform.
引用
收藏
页码:8 / 13
页数:6
相关论文
共 50 条
  • [31] An Efficient Vertical-Apriori Mapreduce Algorithm for Frequent Item-set Mining
    Sun, Dawei
    Lee, Vincent C. S.
    Burstein, Frada
    Haghighi, Pari Delir
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 108 - 112
  • [32] Frequent item-set mining and clustering based ranked biomedical text summarization
    Supriya Gupta
    Aakanksha Sharaff
    Naresh Kumar Nagwani
    The Journal of Supercomputing, 2023, 79 : 139 - 159
  • [33] Frequent Pattern Network Mining Algorithm Based on Transaction-item Association Matrix
    Wang, Yi-Jun
    Sun, Wet-Qing
    She, Jin-Tao
    Wei, Sheng-Biao
    Wang, Cheng-Min
    PROCEEDINGS OF THE 13TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS, 2009, : 498 - +
  • [34] Network Intrusion Detection Systems Analysis using Frequent Item Set Mining Algorithm FP-Max and Apriori
    Hidayanto, Bekti Cahyo
    Muhammad, Rowi Fajar
    Kusumawardani, Renny P.
    Syafaat, Achmad
    4TH INFORMATION SYSTEMS INTERNATIONAL CONFERENCE (ISICO 2017), 2017, 124 : 751 - 758
  • [35] Efficient and accurate personalized product recommendations through frequent item set mining fusion algorithm
    Kang, Lifeng
    Wang, Yankun
    HELIYON, 2024, 10 (03)
  • [36] Incremental Technique with Set of Frequent Word Item sets for Mining Large Indonesian Text Data
    Maylawati, Dian Sa'adillah
    Ramdhani, Muhammad Ali
    Rahman, Ali
    Darmalaksana, Wahyudin
    2017 5TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM 2017), 2017, : 12 - 17
  • [37] Efficiently Using Matrix in Mining Maximum Frequent Itemset
    Liu Zhen-yu
    Xu Wei-xiang
    Liu Xumin
    THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 50 - 54
  • [38] Proposed algorithm for frequent item set generation
    Singh, Archana
    Agarwal, Jyoti
    2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 160 - 165
  • [40] Using Frequent Item Set Mining and Feature Selection Methods to Identify Interacted Risk Factors - The Atrial Fibrillation Case Study
    Li, Xiang
    Liu, Haifeng
    Du, Xin
    Hu, Gang
    Xie, Guotong
    Zhang, Ping
    EXPLORING COMPLEXITY IN HEALTH: AN INTERDISCIPLINARY SYSTEMS APPROACH, 2016, 228 : 562 - 566