Benchmarking the effectiveness of sequential pattern mining methods

被引:11
|
作者
Kum, Hye-Chung [1 ]
Chang, Joong Hyuk
Wang, Wei
机构
[1] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
[2] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
benchmarking effectiveness; evaluating quality of results; sequential pattern mining;
D O I
10.1016/j.datak.2006.01.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there is an increasing interest in new intelligent mining methods to find more meaningful and compact results. In intelligent data mining research, accessing the quality and usefulness of the results from different mining methods is essential. However, there is no general benchmarking criteria to evaluate whether these new methods are indeed more effective compared to the traditional methods. Here we propose a novel benchmarking criteria that can systematically evaluate the effectiveness of any sequential pattern mining method under a variety of situations. The benchmark evaluates how well a mining method finds known common patterns in synthetic data. Such an evaluation provides a comprehensive understanding of the resulting patterns generated from any mining method empirically. In this paper, the criteria are applied to conduct a detailed comparison study of the support-based sequential pattern model with an approximate pattern model based on sequence alignment. The study suggests that the alignment model will give a good summary of the sequential data in the form of a set of common patterns in the data. In contrast, the support model generates massive amounts of frequent patterns with much redundancy. This suggests that the results of the support model require more post processing before it can be of actual use in real applications. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 50
页数:21
相关论文
共 50 条
  • [41] On Probabilistic Models for Uncertain Sequential Pattern Mining
    Muzammal, Muhammad
    Raman, Rajeev
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 60 - 72
  • [42] Declarative Sequential Pattern Mining of Care Pathways
    Guyet, Thomas
    Happe, Andre
    Dauxais, Yann
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2017, 2017, 10259 : 261 - 266
  • [43] A Subsequence Interleaving Model for Sequential Pattern Mining
    Fowkes, Jaroslav
    Sutton, Charles
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 835 - 844
  • [44] A Improved PrefixSpan Algorithm For Sequential Pattern Mining
    Dong, Liang
    Hong, Wang
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 377 - 380
  • [45] Discretized sequential pattern mining for behaviour classification
    Scott Buffett
    Granular Computing, 2021, 6 : 853 - 866
  • [46] Fault diagnosis based on sequential pattern mining
    Hu, Rui-Fei
    Wang, Ling
    Mei, Xiao-Qin
    Luo, Yang
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2010, 16 (07): : 1412 - 1418
  • [47] Sequential Pattern Mining with Multidimensional Interval Items
    Chen, Bob
    Peng, Weiming
    Song, Jihua
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2022, 29 (04): : 1220 - 1229
  • [48] CLOSED SEQUENTIAL PATTERN MINING IN BIOLOGICAL DATA
    Jawahar, S.
    Harishchander, A.
    Devaraju, S.
    Ali, S. Ahamed Johnsha
    Manivasagan, C.
    Sumathi, P.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2020, : 9 - 13
  • [49] NetNMSP: Nonoverlapping maximal sequential pattern mining
    Li, Yan
    Zhang, Shuai
    Guo, Lei
    Liu, Jing
    Wu, Youxi
    Wu, Xindong
    APPLIED INTELLIGENCE, 2022, 52 (09) : 9861 - 9884
  • [50] Sequential Pattern Mining Algorithm Based on Interestingness
    Li, Tao
    Zhang, Shuaichi
    Chen, Hui
    Ren, Yongjun
    Li, Xiang
    Ren, Yongzhen
    2018 FIRST INTERNATIONAL COGNITIVE CITIES CONFERENCE (IC3 2018), 2018, : 69 - 74