A novel algorithm for frequent itemset mining in data warehouses

被引:3
|
作者
徐利军
谢康林
机构
[1] Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai 200030 China
[2] Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai 200030 China
关键词
Frequent itemset; Close itemset; Star schema; Dimension table; Fact table;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Current technology for frequent itemset mining mostly applies to the data stored in a single transaction database. This paper presents a novel algorithm MultiClose for frequent itemset mining in data warehouses. MultiClose respectively computes the results in single dimension tables and merges the results with a very efficient approach. Close itemsets technique is used to improve the performance of the algorithm. The authors propose an efficient implementation for star schemas in which their al- gorithm outperforms state-of-the-art single-table algorithms.
引用
收藏
页码:216 / 224
页数:9
相关论文
共 50 条
  • [1] Novel algorithm for frequent itemset mining in data warehouses
    Xu L.-J.
    Xie K.-L.
    [J]. Journal of Zhejiang University-SCIENCE A, 2006, 7 (2): : 216 - 224
  • [2] An efficient algorithm for frequent itemset mining on data streams
    Xie Zhi-Jun
    Chen Hong
    Li, Cuiping
    [J]. ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 474 - 491
  • [3] A novel parallel frequent itemset mining algorithm for automatic enterprise
    Mao, Yimin
    Wu, Bin
    Deng, Qianhu
    Mahmoodi, Soroosh
    Chen, Zhigang
    Chen, Yeh-Cheng
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2023, 17 (10)
  • [4] AnyFI: An Anytime Frequent Itemset Mining Algorithm for Data Streams
    Goyal, Poonam
    Challa, Jagat Sesh
    Shrivastava, Shivin
    Goyal, Navneet
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 942 - 947
  • [5] An algorithm for in-core frequent itemset mining on streaming data
    Jin, RM
    Agrawal, G
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 210 - 217
  • [6] An efficient frequent itemset mining algorithm
    Luo, Ke
    Zhang, Xue-Mao
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 756 - 761
  • [7] A Novel Parallel Algorithm for Frequent Itemset Mining of Incremental Dataset
    Xu, Lijun
    Zhang, Yun
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING ICISCE 2015, 2015, : 41 - 44
  • [8] A parallel algorithm for frequent itemset mining
    Li, L
    Zhai, DH
    Fan, J
    [J]. PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT'2003, PROCEEDINGS, 2003, : 868 - 871
  • [9] MapReduce Based Frequent Itemset Mining Algorithm on Stream Data
    Chaudhary, Hemant
    Yadav, Deepak Kumar
    Bhatnagar, Rajat
    Chandrasekhar, Uddagiri
    [J]. 2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 586 - 591
  • [10] Frequent Itemset Mining for Big Data
    Chavan, Kiran
    Kulkarni, Priyanka
    Ghodekar, Pooja
    Patil, S. N.
    [J]. 2015 International Conference on Green Computing and Internet of Things (ICGCIoT), 2015, : 1365 - 1368