A Heuristic Rule based Approximate Frequent Itemset Mining Algorithm

被引:3
|
作者
Li, Haifeng [1 ]
Zhang, Yuejin [1 ]
Zhang, Ning [1 ]
Jia, Hengyue [1 ]
机构
[1] Cent Univ Finance & Econ, Sch Informat, Beijing, Peoples R China
关键词
Frequent Itemset; Sampling; Data Mining; Heuristic Rule;
D O I
10.1016/j.procs.2016.07.087
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we focus on the problem of mining the approximate frequent itemsets. To improve the performance, we employ a sampling method, in which a heuristic rule is used to dynamically determine the sampling rate. Two parameters are introduced to implement the rule. Also, we maintain the data synopsis in an in-memory data structure named SFIHtree to speed up the runtime. Our proposed algorithm SFIH can be efficiently performed over this tree. We conducted extensive experiments and showed that the mining performance can be improved significantly with a high accuracy when we used reasonable parameters. (C) 2016 The Authors. Published by Elsevier B.V.
引用
收藏
页码:324 / 333
页数:10
相关论文
共 50 条
  • [1] A Parallel Algorithm for Approximate Frequent Itemset Mining using MapReduce
    Fumarola, Fabio
    Malerba, Donato
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 335 - 342
  • [2] An approximate approach to frequent itemset mining
    Zhang, Chunkai
    Zhang, Xudong
    Tian, Panbo
    [J]. 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 68 - 73
  • [3] Incremental association rule mining using promising frequent itemset algorithm
    Amornchewin, Ratchadaporn
    Kreesuradej, Worapoj
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 780 - 784
  • [4] Fast Mining Algorithm of Frequent Itemset Based on Spark
    Ding, Jia-Man
    Li, Hai-Bin
    Deng, Bin
    Jia, Lian-Yin
    You, Jin-Guo
    [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2446 - 2464
  • [5] Frequent Itemset Mining Algorithm based on Sampling Method
    Li, Haifeng
    Zhang, Ning
    Zhang, Yuejin
    [J]. PROCEEDINGS OF THE 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND AUTOMATION ENGINEERING, 2016, 42 : 852 - 855
  • [6] Frequent Itemset Mining Algorithm Based on Linear Table
    Lu, Jun
    Xu, Wenhe
    Zhou, Kailong
    Guo, Zhicong
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (01)
  • [7] A Distributed Frequent Itemset Mining Algorithm Based on Spark
    Gui, Feng
    Ma, Yunlong
    Zhang, Feng
    Liu, Min
    Li, Fei
    Shen, Weiming
    Bai, Hua
    [J]. PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2015, : 271 - 275
  • [8] Frequent itemset mining based on Heuristic Two Level-Counting
    Liu, Feng
    Tian, FengZhan
    Zhu, QiLiang
    [J]. ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 640 - +
  • [9] An efficient frequent itemset mining algorithm
    Luo, Ke
    Zhang, Xue-Mao
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 756 - 761
  • [10] A parallel algorithm for frequent itemset mining
    Li, L
    Zhai, DH
    Fan, J
    [J]. PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 868 - 871