Parallelized Frequent Item Set Mining Using a Tall and Skinny Matrix

被引:0
|
作者
Janakiram, D. Pooja [1 ]
机构
[1] Indian Inst Technol Madras, Madras, Tamil Nadu, India
关键词
D O I
10.1109/ICDMW.2016.198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data applications consist of very large collection of small records, for example data from a retail website, data from movie streaming services, sensor data applications and many other such applications. Frequent item set mining is one of the common tools used for all these applications to generate recommendations to improve user experience of the website. Frequent itemset mining is also used to find interesting patterns on scientific databases such as gene expression database. One interesting way to represent such big data applications is by transforming them into tall and skinny matrices. In this paper we explore the concept of tall and skinny matrices to generate frequent item sets. The proposed algorithm is implemented on a map-reduce based framework such as Apache Spark and experiments are performed to test the scalability of the algorithm on a cloud platform.
引用
收藏
页码:8 / 13
页数:6
相关论文
共 50 条
  • [41] Frequent Item set Using Abundant Data on Hadoop Clusters in Big Data
    Danapaquiame, N.
    Balaji, V.
    Gayathri, R.
    Kodhai, E.
    Sambasivam, G.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2018, 11 (01): : 104 - 112
  • [42] Mining Compressed Frequent Subtrees Set
    ZHAO Chuanshen1
    2. School of Computer Science and Engineering
    3. Department of Computer
    WuhanUniversityJournalofNaturalSciences, 2009, 14 (01) : 29 - 34
  • [43] A Parallelized Frequent Temporal Pattern Mining Algorithm on a Time Series Database
    Nguyen Thanh Vu
    Chau Vo
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 78 - 91
  • [44] TSM2: Optimizing Tall-and-Skinny Matrix-Matrix Multiplication on GPUs
    Chen, Jieyang
    Xiong, Nan
    Liang, Xin
    Tao, Dingwen
    Li, Sihuan
    Ouyang, Kaiming
    Zhao, Kai
    DeBardeleben, Nathan
    Guan, Qiang
    Chen, Zizhong
    INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2019), 2019, : 106 - 116
  • [45] Performance engineering for real and complex tall & skinny matrix multiplication kernels on GPUs
    Ernst, Dominik
    Hager, Georg
    Thies, Jonas
    Wellein, Gerhard
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2021, 35 (01): : 5 - 19
  • [46] A vertical format algorithm for mining frequent item sets
    Guo Yi-ming
    Wang Zhi-jun
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 11 - 13
  • [47] Frequent Item Mining When Obtaining Support Is Costly
    Lin, Joe Wing-Ho
    Wong, Raymond Chi-Wing
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2019, 2019, 11708 : 37 - 56
  • [48] Bayesian classifier based on frequent item sets mining
    Xu, Junming
    Jiang, Yuan
    Zhou, Zhihua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2007, 44 (08): : 1293 - 1300
  • [49] Research on the Application of Frequent Item Mining in Credit Risks
    Zhang, Wenchao
    2016 INTERNATIONAL CONGRESS ON COMPUTATION ALGORITHMS IN ENGINEERING (ICCAE 2016), 2016, : 146 - 151
  • [50] A new mining algorithm based on frequent item sets
    Wen Yun
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 410 - 413