A Proposition for Sequence Mining Using Pattern Structures

被引:6
|
作者
Codocedo, Victor [1 ,3 ]
Bosc, Guillaume [2 ]
Kaytoue, Mehdi [2 ]
Boulicaut, Jean-Francois [2 ]
Napoli, Amedeo [3 ]
机构
[1] Inria Chile, Las Condes, Chile
[2] Univ Lyon, CNRS, INSA Lyon, LIRIS, Lyon, France
[3] Univ Lorraine, INRIA Nancy Grand Est, CNRS, LORIA, Nancy, France
来源
关键词
D O I
10.1007/978-3-319-59271-8_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we present a novel approach to rare sequence mining using pattern structures. Particularly, we are interested in mining closed sequences, a type of maximal sub-element which allows providing a succinct description of the patterns in a sequence database. We present and describe a sequence pattern structure model in which rare closed subsequences can be easily encoded. We also propose a discussion and characterization of the search space of closed sequences and, through the notion of sequence alignments, provide an intuitive implementation of a similarity operator for the sequence pattern structure based on directed acyclic graphs. Finally, we provide an experimental evaluation of our approach in comparison with state-of-the-art closed sequence mining algorithms showing that our approach can largely outperform them when dealing with large regions of the search space.
引用
收藏
页码:106 / 121
页数:16
相关论文
共 50 条
  • [11] Mining Preserving Structures in a Graph Sequence
    Uno, Takeaki
    Uno, Yushi
    COMPUTING AND COMBINATORICS, 2015, 9198 : 3 - 15
  • [12] Design Pattern Mining Using Distributed Learning Automata and DNA Sequence Alignment
    Esmaeilpour, Mansour
    Naderifar, Vahideh
    Shukur, Zarina
    PLOS ONE, 2014, 9 (09):
  • [13] Mining Pattern Changes in Sensor Data Streams using Approximate Sequence Alignment
    Meng, Frank
    Nystrom, Donna
    WMSCI 2008: 12TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VII, PROCEEDINGS, 2008, : 87 - 92
  • [14] An Empirical Study on Retrieving Structural Clones Using Sequence Pattern Mining Algorithms
    Udagawa, Yoshihisa
    16TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS 2014), 2014, : 270 - 276
  • [15] Sequence Pattern Mining based on Markov Chain
    Zhang Junyan
    Yang Chenhui
    2015 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2015, : 234 - 238
  • [16] Frequent Sequence Pattern Mining with Differential Privacy
    Zhou, Fengli
    Lin, Xiaoli
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT I, 2018, 10954 : 454 - 466
  • [17] The Maximal Frequent Pattern Mining of DNA Sequence
    Bai, Shuang
    Bai, Si-Xue
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 23 - 26
  • [18] Closed inter-sequence pattern mining
    Wang, Chun-Sheng
    Liu, Ying-Ho
    Chu, Kuo-Chung
    JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (06) : 1603 - 1612
  • [19] AN ATTACK PATTERN MINING ALGORITHM BASED ON FUZZY LOGIC AND SEQUENCE PATTERN
    Li, Yang
    Xue, Ying
    Yao, Yuangang
    Zhao, Xianghui
    Liu, Jianyi
    Zhang, Ru
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 234 - 238
  • [20] Enhanced sequence identification technique for protein sequence database mining with hybrid frequent pattern mining algorithm
    Jeyabharathi, J.
    Shanthi, D.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 16 (03) : 205 - 229