Efficiently Mining Maximal Diverse Frequent Itemsets

被引:3
|
作者
Wu, Dingming [1 ]
Luo, Dexin [1 ]
Jensen, Christian S. [2 ]
Huang, Joshua Zhexue [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, Aalborg, Denmark
关键词
Frequent itemsets; Diversification; Algorithm; ALGORITHM; PATTERNS; MAFIA;
D O I
10.1007/978-3-030-18579-4_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a database of transactions, where each transaction is a set of items, maximal frequent itemset mining aims to find all itemsets that are frequent, meaning that they consist of items that co-occur in transactions more often than a given threshold, and that are maximal, meaning that they are not contained in other frequent itemsets. Such itemsets are the most interesting ones in a meaningful sense. We study the problem of efficiently finding such itemsets with the added constraint that only the top-k most diverse ones should be returned. An itemset is diverse if its items belong to many different categories according to a given hierarchy of item categories. We propose a solution that relies on a purposefully designed index structure called the FP*-tree and an accompanying bound-based algorithm. An extensive experimental study offers insight into the performance of the solution, indicating that it is capable of outperforming an existing method by orders of magnitude and of scaling to large databases of transactions.
引用
收藏
页码:191 / 207
页数:17
相关论文
共 50 条
  • [1] Efficiently mining maximal frequent itemsets
    Gouda, K
    Zaki, MJ
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 163 - 170
  • [2] Efficiently mining maximal frequent itemsets based on digraph
    Ren, Zhibo
    Zhang, Qiang
    Ma, Xiujuan
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 140 - +
  • [3] On Maximal Frequent Itemsets Mining with Constraints
    Jabbour, Said
    Mana, Fatima Ezzahra
    Dlala, Imen Ouled
    Raddaoui, Badran
    Sais, Lakhdar
    [J]. PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, 2018, 11008 : 554 - 569
  • [4] An adaptive approach to mining frequent itemsets efficiently
    Tseng, Fan-Chen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (18) : 13166 - 13172
  • [5] EFFICIENTLY MINING FREQUENT ITEMSETS IN TRANSACTIONAL DATABASES
    Alghyaline, Salah
    Hsieh, Jun-Wei
    Lai, Jim Z. C.
    [J]. JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2016, 24 (02): : 184 - 191
  • [6] Efficiently Mining Frequent Itemsets on Massive Data
    Han, Xixian
    Liu, Xianmin
    Chen, Jian
    Lai, Guojun
    Gao, Hong
    Li, Jianzhong
    [J]. IEEE ACCESS, 2019, 7 : 31409 - 31421
  • [7] Mining maximal frequent itemsets with frequent pattern list
    Qian, Jin
    Ye, Feiyue
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 628 - 632
  • [8] Mining maximal frequent itemsets for intrusion detection
    Wang, H
    Li, QH
    Xiong, HY
    Jiang, SY
    [J]. GRID AND COOPERATIVE COMPUTING GCC 2004 WORKSHOPS, PROCEEDINGS, 2004, 3252 : 422 - 429
  • [9] A New Method for Mining Maximal Frequent Itemsets
    Nadimi-Shahraki, Mohammad
    Mustapha, Norwati
    Sulaiman, Md Nasir B.
    Mamat, Ali B.
    [J]. INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1064 - 1067
  • [10] Fast algorithm for mining maximal frequent itemsets
    Ma, Lisheng
    Deng, Huiwen
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL SYMPOSIUM ON DATA, PRIVACY, AND E-COMMERCE, 2007, : 86 - +