Discovery of maximum length frequent itemsets

被引:51
|
作者
Hu, Tianming
Sung, Sam Yuan
Xiong, Hui
Fu, Qian
机构
[1] S Texas Coll, Dept Comp Sci, Houston, TX 77002 USA
[2] Rutgers State Univ, MSIS Dept, Piscataway, NJ 08855 USA
[3] Natl Univ Singapore, Dept Comp Sci, Singapore 117548, Singapore
关键词
association analysis; frequent itemsets; maximum length frequent itemsets; FP-tree; data mining;
D O I
10.1016/j.ins.2007.08.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of frequent itemsets has been limited by the high computational cost as well as the large number of resulting itemsets. In many real-world scenarios, however, it is often sufficient to mine a small representative subset of frequent itemsets with low computational cost. To that end, in this paper, we define a new problem of finding the frequent itemsets with a maximum length and present a novel algorithm to solve this problem. Indeed, maximum length frequent itemsets can be efficiently identified in very large data sets and are useful in many application domains. Our algorithm generates the maximum length frequent itemsets by adapting a pattern fragment growth methodology based on the FP-tree structure. Also, a number of optimization techniques have been exploited to prune the search space. Finally, extensive experiments on real-world data sets validate the proposed algorithm. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:69 / 87
页数:19
相关论文
共 50 条
  • [1] Mining maximum length frequent itemsets: A summary of results
    Hu, Tianming
    Fu, Qian
    Wang, Xiaonan
    Sung, Sam Yuan
    [J]. ICTAI-2006: EIGHTEENTH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 505 - 512
  • [2] Maximum frequent itemsets discovery algorithm based on granular computing
    Chi, Haicheng
    Liu, Zhen
    [J]. 2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 156 - 161
  • [3] A novel pruning algorithm for mining long and maximum length frequent itemsets
    Lessanibahri, Sina
    Gastaldi, Luca
    Gonzalez Fernandez, Camino
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 142
  • [4] Automatic discovery of locally frequent itemsets in the presence of highly frequent itemsets
    Bodon, Ferenc
    Kouris, Ioannis N.
    Makris, Christos H.
    Tsakalidis, Athanasios K.
    [J]. INTELLIGENT DATA ANALYSIS, 2005, 9 (01) : 83 - 104
  • [5] Fast mining maximum frequent itemsets
    Lu, S.F.
    Lu, Z.D.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2001, 12 (02): : 293 - 297
  • [6] A new algorithm for discovery of frequent itemsets
    Kotasek, P
    Zendulka, J
    [J]. KNOWLEDGE-BASED SOFTWARE ENGINEERING, 1998, 48 : 183 - 186
  • [7] Mining maximum frequent itemsets based on directed itemsets graph
    Wen Lei
    [J]. PROCEEDINGS OF 2004 CHINESE CONTROL AND DECISION CONFERENCE, 2004, : 681 - 683
  • [8] The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree
    Mi, Xifeng
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [9] Discovery of Frequent Itemsets: Frequent Item Tree-Based Approach
    Kumar, A. V. Senthil
    Wahidabanu, R. S. D.
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2007, 1 (01) : 42 - 55
  • [10] Fast Mining Algorithm of Global Maximum Frequent Itemsets
    He, Bo
    [J]. ADVANCES IN ELECTRICAL ENGINEERING AND AUTOMATION, 2012, 139 : 199 - 204