Parallelized Frequent Item Set Mining Using a Tall and Skinny Matrix

被引:0
|
作者
Janakiram, D. Pooja [1 ]
机构
[1] Indian Inst Technol Madras, Madras, Tamil Nadu, India
来源
2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW) | 2016年
关键词
D O I
10.1109/ICDMW.2016.198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data applications consist of very large collection of small records, for example data from a retail website, data from movie streaming services, sensor data applications and many other such applications. Frequent item set mining is one of the common tools used for all these applications to generate recommendations to improve user experience of the website. Frequent itemset mining is also used to find interesting patterns on scientific databases such as gene expression database. One interesting way to represent such big data applications is by transforming them into tall and skinny matrices. In this paper we explore the concept of tall and skinny matrices to generate frequent item sets. The proposed algorithm is implemented on a map-reduce based framework such as Apache Spark and experiments are performed to test the scalability of the algorithm on a cloud platform.
引用
收藏
页码:8 / 13
页数:6
相关论文
共 50 条
  • [21] FIMSIM: Discovering Communities by Frequent Item-Set Mining and Similarity Search
    Peschel, Jakub
    Batko, Michal
    Valcik, Jakub
    Sedmidubsky, Jan
    Zezula, Pavel
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2021, 2021, 13058 : 372 - 383
  • [22] Collaborative privacy preserving frequent item set mining in vertically partitioned databases
    Gudes, E
    Rozenberg, B
    DATA AND APPLICATIONS SECURITY XVII: STATUS AND PROSPECTS, 2004, 142 : 91 - 104
  • [23] Optimization of frequent item set mining parallelization algorithm based on spark platform
    Deng, Fan
    Wang, Jiabin
    Lv, Sheng
    DISCOVER COMPUTING, 2024, 27 (01)
  • [24] Survey on Frequent Item-Set Mining Approaches in Market Basket Analysis
    Maske, Anisha
    Joglekar, Bela
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [25] Application of Frequent Item Set Mining Algorithm in IDS Based on Hadoop Framework
    Tong, Zhang
    Ying, Hou
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 1908 - 1911
  • [26] Mining frequent pattern using item-transformation method
    Chu, TP
    Wu, F
    Chiang, SW
    FOURTH ANNUAL ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2005, : 698 - 706
  • [27] AN EFFICIENT ALGORITHM FOR DETECTING OUTLIERS IN A DISTRIBUTED ENVIRONMENT USING MINIMAL IN-FREQUENT ITEM SET PATTERN MINING
    Chandran, Chandra Ravi
    Padmanabhan, Ajitha
    IIOAB JOURNAL, 2016, 7 (09) : 22 - 25
  • [28] Maximal Frequent Item Sequences Mining
    Zhou Lijuan
    Zhang Zhang
    PROGRESS IN MEASUREMENT AND TESTING, PTS 1 AND 2, 2010, 108-111 : 1211 - 1216
  • [29] Frequent item-set mining and clustering based ranked biomedical text summarization
    Gupta, Supriya
    Sharaff, Aakanksha
    Nagwani, Naresh Kumar
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (01): : 139 - 159
  • [30] Machine Learning based Network Intrusion Detection with Hybrid Frequent Item Set Mining
    Firat, Murat
    Bakal, Gokhan
    Akbas, Ayhan
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024, 27 (05):