A parameterizable enumeration algorithm for sequence mining

被引:0
|
作者
David, J. [1 ]
Nourine, L. [1 ]
机构
[1] LIMOS Univ Blaise Pascal, UMR 6158, F-63173 Aubiere, France
关键词
COMPLEXITY;
D O I
10.1016/j.tcs.2012.11.005
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we introduce an generic framework for the mining of sequences under various constraints. More precisely, we study the enumeration of all partitions of a word w into multisets of subsequences. We show that using additional predicates, this generator can be used for frequent subsequences and substrings mining. We define the transition graph T-w whose vertices are multisets of words and arcs are transitions between multisets. We show that T-w is a directed acyclic graph and it admits a covering tree. We use T-w to propose a generic algorithm that enumerates all multisets that satisfies a set of predicates, without redundancy. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:59 / 68
页数:10
相关论文
共 50 条
  • [1] Applying Parameterizable Dynamic Configurations to Sequence Alignment
    Davidson, Tom
    Bruneel, Karel
    Devos, Harald
    Stroobandt, Dirk
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 616 - 623
  • [2] An incremental sequence pattern mining algorithm
    Fu, Zhongliang
    Chen, Nan
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2010, 35 (07): : 763 - 767
  • [3] An improved parallel algorithm for sequence mining
    She, Chundong
    Tang, Jian
    Li, Lei
    Wang, Hongbing
    Fan, Zhihua
    2005 IEEE International Conference on Mechatronics and Automations, Vols 1-4, Conference Proceedings, 2005, : 1692 - 1696
  • [4] Hierarchical Sequence Clustering Algorithm for Data Mining
    Chezhian, V. Umadevi
    Subash, Thanappan
    Samy, M. Ragavan
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL III, 2011, : 1861 - 1864
  • [5] Sustainable evolutionary algorithm based on sequence mining
    Yang, Guanci
    Li, Qin
    Li, Shaobo
    Zhong, Yong
    Guo, Guanqi
    Journal of Computational Information Systems, 2011, 7 (02): : 599 - 606
  • [6] A polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence
    Arimura, H
    Uno, T
    ALGORITHMS AND COMPUTATION, 2005, 3827 : 724 - 737
  • [7] Mining Web Browsing Log by Using Relaxed Biclique Enumeration Algorithm in MapReduce
    Su, Chung-Tsai
    Tsao, Wen-Kwang
    Chu, Wei-Rong
    Liao, Ming-Ray
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 54 - 58
  • [8] Enhanced sequence identification technique for protein sequence database mining with hybrid frequent pattern mining algorithm
    Jeyabharathi, J.
    Shanthi, D.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 16 (03) : 205 - 229
  • [9] An algorithm of association rules mining based on digit sequence
    Fang, Gang
    Wu, Yuan-Bin
    Liu, Yu-Lu
    Xiong, Jiang
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 532 - 535
  • [10] A AprioriAll Sequence Mining Algorithm Based on Learner Behavior
    Yu, Zhenghong
    Li, Dan
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 560 - 569