Maintenance of Maximal Frequent Itemsets in Large Databases

被引:0
|
作者
Lian, Wang
Cheung, David W.
Yiu, S. M.
机构
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
There have been many studies on efficient discovery of maximal frequent itemsets in large databases. However, it is nontrivial to maintain such discovered itemsets if more and more data is inserted into the database as the insertions may invalidate some existing maximal frequent itemsets and also create some new ones. In this paper, we clearly address the relationships between old and new maximal frequent itemsets and propose an algorithm IMFI, which is based on these relationships to reuse previously discovered knowledge. The algorithm follows a top-down mechanism rather than traditional bottom-up methods to produce fewer candidates. Moreover, we integrate SG-tree into IMFI to improve the counting efficiency, which is faster than those methods based on vertical bitmap database representation. Evaluations on IMFI have been performed using both synthetic and real databases. Preliminary results show that applying IMFI is always much faster than an available incremental MFI mining algorithm, especially when it is equipped with SG-tree.
引用
收藏
页码:388 / 392
页数:5
相关论文
共 50 条
  • [1] Mining maximal frequent itemsets for large scale transaction databases
    Xia, R
    Yuan, W
    Ding, SC
    Liu, J
    Zhou, HB
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1480 - 1485
  • [2] Incremental Maintenance of Frequent Itemsets in Evidential Databases
    Tobji, Mohamed Anis Bach
    Ben Yaghlane, Boutheina
    Mellouli, Khaled
    [J]. SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, PROCEEDINGS, 2009, 5590 : 457 - +
  • [3] Parallel mining of maximal frequent itemsets from databases
    Chung, SM
    Luo, C
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 134 - 139
  • [4] Mining Maximal Frequent Itemsets over Sampling Databases
    Li, Haifeng
    [J]. PROCEEDINGS OF THE 2015 2ND INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION (IFEEA 2015), 2016, 54 : 28 - 31
  • [5] Mining Frequent Gradual Itemsets from Large Databases
    Di-Jorio, Lisa
    Laurent, Anne
    Teisseire, Maguelonne
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 297 - +
  • [6] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Soon M. Chung
    Congnan Luo
    [J]. Knowledge and Information Systems, 2008, 16 : 359 - 391
  • [7] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Chung, Soon M.
    Luo, Congnan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 359 - 391
  • [8] Incremental mining of weighted maximal frequent itemsets from dynamic databases
    Yun, Unil
    Lee, Gangin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 54 : 304 - 327
  • [9] Distributed mining of maximal frequent itemsets from Databases on a cluster of workstations
    Chung, SM
    Luo, CN
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID - CCGRID 2004, 2004, : 499 - 507
  • [10] Mining frequent closed itemsets in large databases by hierarchical partitioning
    Tseng, Fan-Chen
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1832 - 1837