Non-redundant sequential rules-Theory and algorithm

被引:42
|
作者
Lo, David [1 ]
Khoo, Siau-Cheng [2 ]
Wong, Limsoon [2 ]
机构
[1] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
[2] Natl Univ Singapore, Dept Comp Sci, Singapore 117548, Singapore
关键词
Theoretical data mining; Frequent pattern mining; Sequential pattern mining; Sequential rules; Non-redundant rules;
D O I
10.1016/j.is.2009.01.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A sequential rule expresses a relationship between two series of events happening one after another. Sequential rules are potentially useful for analyzing data in sequential format, ranging from purchase histories, network logs and program execution traces. In this work, we investigate and propose a syntactic characterization of a non-redundant set of sequential rules built upon past work on compact set of representative patterns. A rule is redundant if it can be inferred from another rule having the same support and confidence. When using the set of mined rules as a composite filter, replacing a full set of rules with a non-redundant subset of the rules does not impact the accuracy of the filter. We consider several rule sets based on composition of various types of pattern sets-generators, projected-database generators, closed patterns and projected-database closed patterns. We investigate the completeness and tightness of these rule sets. We characterize a tight and complete set of non-redundant rules by defining it based on the composition of two pattern sets. Furthermore, we propose a compressed set of non-redundant rules in a spirit similar to how closed patterns serve as a compressed representation of a full set of patterns. Lastly, we propose an algorithm to mine this compressed set of non-redundant rules. A performance study shows that the proposed algorithm significantly improves both the runtime and compactness of mined rules over mining a full set of sequential rules. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:438 / 453
页数:16
相关论文
共 50 条
  • [31] Extracting non-redundant episode rules based on frequent closed episodes and their generators
    [J]. Zhu, H.-S. (zhs@fudan.edu.cn), 2012, Science Press (35):
  • [32] Non-redundant data clustering
    Gondek, D
    Hofmann, T
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 75 - 82
  • [33] Mining Non-Redundant Substitution Rules Between Sets of Items in Large Databases
    Chen, Yi-Chun
    Lee, Guanling
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (02) : 659 - 674
  • [34] Non-redundant data clustering
    David Gondek
    Thomas Hofmann
    [J]. Knowledge and Information Systems, 2007, 12 : 1 - 24
  • [35] Non-redundant data clustering
    Gondek, David
    Hofmann, Thomas
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (01) : 1 - 24
  • [36] Mining Consistent, Non-Redundant and Minimal Negative Rules Based on Minimal Generators
    Iwanuma, Koji
    Yajima, Kento
    Yamamoto, Yoshitaka
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5706 - 5709
  • [37] Extracting Non-Redundant Approximate Rules from Multi-Level Datasets
    Shaw, Gavin
    Xu, Yue
    Geva, Shlomo
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 2, PROCEEDINGS, 2008, : 333 - 340
  • [38] Rare-PEARs: A new multi objective evolutionary algorithm to mine rare and non-redundant quantitative association rules
    Almasi, Mehrdad
    Abadeh, Mohammad Saniee
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 366 - 384
  • [39] Hierarchical control in redundant and non-redundant postural tasks
    James, Eric G.
    Newell, Karl M.
    [J]. HUMAN MOVEMENT SCIENCE, 2011, 30 (06) : 1167 - 1184
  • [40] Non-redundant and redundant post coding in OFDM systems
    Shah, S. F. A.
    Tewfik, A. H.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4407 - 4410