A Highly Parallel Algorithm for Frequent Itemset Mining

被引:0
|
作者
Mesa, Alejandro [1 ,2 ]
Feregrino-Uribe, Claudia [2 ]
Cumplido, Rene [2 ]
Hernandez-Palancar, Jose [1 ]
机构
[1] CENATAV, Adv Technol Applicat Ctr, Havana, Cuba
[2] Natl Inst Astrophys Opt & Elect, Puebla, Mexico
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining frequent itemsets in large databases is a widely used technique in Data Mining. Several sequential and parallel algorithms have been developed, although, when dealing with high data volumes, the execution of those algorithms takes more time and resources than expected. Because of this, finding alternatives to speed up the execution time of those algorithms is an active topic of research. Previous attempts of acceleration using custom architectures have been limited because of the nature of the algorithms that have been conceived sequentially and do riot exploit the intrinsic parallelism that the hardware provides. The innovation in this paper is a highly parallel algorithm that utilizes a vertical bit vector (VBV) data layout and its feasibility for making support counting. Our results show that for dense databases a custom architecture for this algorithm can perform faster than the fastest architecture reported in previous works by one order of magnitude.
引用
收藏
页码:291 / +
页数:2
相关论文
共 50 条
  • [31] A highly scalable parallel algorithm for maximally informative k-itemset mining
    Saber Salah
    Reza Akbarinia
    Florent Masseglia
    [J]. Knowledge and Information Systems, 2017, 50 : 1 - 26
  • [32] Frequent Itemset Mining Algorithm Based on Linear Table
    Lu, Jun
    Xu, Wenhe
    Zhou, Kailong
    Guo, Zhicong
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (01)
  • [33] A highly scalable parallel algorithm for maximally informative k-itemset mining
    Salah, Saber
    Akbarinia, Reza
    Masseglia, Florent
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (01) : 1 - 26
  • [34] A novel algorithm for frequent itemset mining in data warehouses
    徐利军
    谢康林
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2006, (02) : 216 - 224
  • [35] A Distributed Frequent Itemset Mining Algorithm Based on Spark
    Gui, Feng
    Ma, Yunlong
    Zhang, Feng
    Liu, Min
    Li, Fei
    Shen, Weiming
    Bai, Hua
    [J]. PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2015, : 271 - 275
  • [36] Accelerating Parallel Frequent Itemset Mining on Graphics Processors with Sorting
    Huang, Yuan-Shao
    Yu, Kun-Ming
    Zhou, Li-Wei
    Hsu, Ching-Hsien
    Liu, Sheng-Hui
    [J]. NETWORK AND PARALLEL COMPUTING, NPC 2013, 2013, 8147 : 245 - 256
  • [37] An efficient polynomial delay algorithm for pseudo frequent itemset mining
    Uno, Takeaki
    Arimura, Hiroki
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2007, 4755 : 219 - +
  • [38] Frequent itemset mining-based spatial subclustering algorithm
    Wang, Qian
    Gao, Zhi-Peng
    Qiu, Xue-Song
    Wang, Xing-Bin
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2015, 38 : 20 - 23
  • [39] AnyFI: An Anytime Frequent Itemset Mining Algorithm for Data Streams
    Goyal, Poonam
    Challa, Jagat Sesh
    Shrivastava, Shivin
    Goyal, Navneet
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 942 - 947
  • [40] A frequent itemset mining algorithm based on composite granular computing
    Wu, Hongjuan
    Liu, Yulu
    Yan, Pei
    Fang, Gang
    Zhong, Jing
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2018, 18 (01) : 247 - 257