Summary queries for frequent itemsets mining

被引:2
|
作者
Zhang, Shichao [1 ,2 ]
Jin, Zhi [3 ]
Lu, Jingli [4 ]
机构
[1] Zhejiang Normal Univ, Dept Comp Sci, Inst Comp Software & Theory, Jinhua, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[3] Peking Univ, Sch EE & CS, Key Lab High Confidence Software Technol, Beijing 100871, Peoples R China
[4] Massey Univ, Inst Informat Sci & Tech, Palmertson N, New Zealand
基金
澳大利亚研究理事会;
关键词
Frequent pattern discovery; Summary query; Support distribution; ASSOCIATION RULES; SUPPORT; PATTERNS;
D O I
10.1016/j.jss.2009.09.026
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
There are many advanced techniques that can efficiently mine frequent itemsets using a minimum-support. However, the question that remains unanswered is whether the minimum-support can really help decision makers to make decisions. In this paper, we study four summary queries for frequent itemsets mining, namely, (1) finding a support-average of itemsets, (2) finding a support-quantile of itemsets, (3) finding the number of itemsets that greater/less than the support-average, i.e., an approximated distribution of itemsets, and (4) finding the relative frequency of an itemset (compared its frequency with that of other itemsets in the same dataset). With these queries, a decision maker will know whether an itemset in question is greater/less than the support-quantile; the distribution of itemsets; and the frequentness of an itemset. Processing these summary queries is challenging, because the minimum-support constraint cannot be used to prune infrequent itemsets. In this paper, we propose several simple yet effective approximation solutions. We conduct extensive experiments for evaluating Our strategy, and illustrate that the proposed approaches can well model and capture the statistical parameters (summary queries) of itemsets in a database. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:405 / 411
页数:7
相关论文
共 50 条
  • [1] Mining maximum length frequent itemsets: A summary of results
    Hu, Tianming
    Fu, Qian
    Wang, Xiaonan
    Sung, Sam Yuan
    [J]. ICTAI-2006: EIGHTEENTH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 505 - 512
  • [2] Mining frequent itemsets in a stream
    Calders, Toon
    Dexters, Nele
    Gillis, Joris J.M.
    Goethals, Bart
    [J]. Information Systems, 2014, 39 (01) : 233 - 255
  • [3] Mining frequent itemsets in a stream
    Calders, Toon
    Dexters, Nele
    Goethals, Bart
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 83 - +
  • [4] An Algorithm for Mining Frequent Itemsets
    Hernandez Leon, Raudel
    Perez Suarez, Airel
    Feregrino Uribe, Claudia
    Guzman Zavaleta, Zobeida Jezabel
    [J]. 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2008), 2008, : 236 - +
  • [5] Mining frequent itemsets in a stream
    Calders, Toon
    Dexters, Nele
    Gillis, Joris J. M.
    Goethals, Bart
    [J]. INFORMATION SYSTEMS, 2014, 39 : 233 - 255
  • [6] Mining frequent closed itemsets with the frequent pattern list
    Tseng, FC
    Hsu, CC
    Chen, H
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 653 - 654
  • [7] Distributed Frequent Closed Itemsets Mining
    Liu, Chun
    Zheng, Zheng
    Cai, Kai-Yuan
    Zhang, Shichao
    [J]. SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 43 - 50
  • [8] Incremental Frequent Itemsets Mining with MapReduce
    Kandalov, Kirill
    Gudes, Ehud
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 10509 : 247 - 261
  • [9] Efficiently mining maximal frequent itemsets
    Gouda, K
    Zaki, MJ
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 163 - 170
  • [10] Mining Frequent Weighted Closed Itemsets
    Bay Vo
    Nhu-Y Tran
    Duong-Ha Ngo
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 379 - 390