Benchmarking the effectiveness of sequential pattern mining methods

被引:11
|
作者
Kum, Hye-Chung [1 ]
Chang, Joong Hyuk
Wang, Wei
机构
[1] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
[2] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
benchmarking effectiveness; evaluating quality of results; sequential pattern mining;
D O I
10.1016/j.datak.2006.01.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there is an increasing interest in new intelligent mining methods to find more meaningful and compact results. In intelligent data mining research, accessing the quality and usefulness of the results from different mining methods is essential. However, there is no general benchmarking criteria to evaluate whether these new methods are indeed more effective compared to the traditional methods. Here we propose a novel benchmarking criteria that can systematically evaluate the effectiveness of any sequential pattern mining method under a variety of situations. The benchmark evaluates how well a mining method finds known common patterns in synthetic data. Such an evaluation provides a comprehensive understanding of the resulting patterns generated from any mining method empirically. In this paper, the criteria are applied to conduct a detailed comparison study of the support-based sequential pattern model with an approximate pattern model based on sequence alignment. The study suggests that the alignment model will give a good summary of the sequential data in the form of a set of common patterns in the data. In contrast, the support model generates massive amounts of frequent patterns with much redundancy. This suggests that the results of the support model require more post processing before it can be of actual use in real applications. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 50
页数:21
相关论文
共 50 条
  • [31] Prefix and Suffix Sequential Pattern Mining
    Singh, Rina
    Graves, Jeffrey A.
    Talbert, Douglas A.
    Eberle, William
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 309 - 324
  • [32] Keyphrase Extraction with Sequential Pattern Mining
    Wang, Qingren
    Sheng, Victor S.
    Wu, Xindong
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5003 - 5004
  • [33] SQUIRE: Sequential pattern mining with quantities
    Kim, C
    Lim, JH
    Ng, R
    Shim, K
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 827 - 827
  • [34] Sequential Pattern Mining: A Survey on Approaches
    Boghey, Rajesh
    Singh, Shailendra
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 670 - 674
  • [35] Sequential pattern mining in multiple streams
    Chen, G
    Wu, XD
    Zhu, XQ
    Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 585 - 588
  • [36] Mining probabilistic automata: a statistical view of sequential pattern mining
    Jacquemont, Stephanie
    Jacquenet, Francois
    Sebban, Marc
    MACHINE LEARNING, 2009, 75 (01) : 91 - 127
  • [37] Mining probabilistic automata: a statistical view of sequential pattern mining
    Stéphanie Jacquemont
    François Jacquenet
    Marc Sebban
    Machine Learning, 2009, 75 : 91 - 127
  • [38] Generalized Net of the Process of Sequential Pattern Mining by Generalized Sequential Pattern Algorithm (GSP)
    Bureva, Veselina
    Sotirova, Evdokia
    Chountas, Panagiotis
    INTELLIGENT SYSTEMS'2014, VOL 2: TOOLS, ARCHITECTURES, SYSTEMS, APPLICATIONS, 2015, 323 : 831 - 838
  • [39] A Review on Sequential Pattern Mining using Pattern Growth Approach
    Patel, Roshani
    Chaudhari, Tarunika
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1424 - 1427
  • [40] Sequential pattern mining: Optimum maximum sequential patterns and consistent sequential patterns
    Wang, Xilu
    Ya, Weili
    2007 IEEE INTERNATIONAL CONFERENCE ON INTEGRATION TECHNOLOGY, PROCEEDINGS, 2007, : 365 - +