Benchmarking the effectiveness of sequential pattern mining methods

被引:11
|
作者
Kum, Hye-Chung [1 ]
Chang, Joong Hyuk
Wang, Wei
机构
[1] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
[2] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
benchmarking effectiveness; evaluating quality of results; sequential pattern mining;
D O I
10.1016/j.datak.2006.01.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there is an increasing interest in new intelligent mining methods to find more meaningful and compact results. In intelligent data mining research, accessing the quality and usefulness of the results from different mining methods is essential. However, there is no general benchmarking criteria to evaluate whether these new methods are indeed more effective compared to the traditional methods. Here we propose a novel benchmarking criteria that can systematically evaluate the effectiveness of any sequential pattern mining method under a variety of situations. The benchmark evaluates how well a mining method finds known common patterns in synthetic data. Such an evaluation provides a comprehensive understanding of the resulting patterns generated from any mining method empirically. In this paper, the criteria are applied to conduct a detailed comparison study of the support-based sequential pattern model with an approximate pattern model based on sequence alignment. The study suggests that the alignment model will give a good summary of the sequential data in the form of a set of common patterns in the data. In contrast, the support model generates massive amounts of frequent patterns with much redundancy. This suggests that the results of the support model require more post processing before it can be of actual use in real applications. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 50
页数:21
相关论文
共 50 条
  • [21] A Survey on Closed Sequential Pattern Mining
    Raju, V. Purushothama
    Varma, G. P. Saradhi
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [22] A Taxonomy of Sequential Pattern Mining Algorithms
    Mabroukeh, Nizar R.
    Ezeife, C. I.
    ACM COMPUTING SURVEYS, 2010, 43 (01)
  • [23] TaSPM: Targeted Sequential Pattern Mining
    Huang, Gengsen
    Gan, Wensheng
    Yu, Philip S.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [24] Weighted frequent sequential pattern mining
    Md Ashraful Islam
    Mahfuzur Rahman Rafi
    Al-amin Azad
    Jesan Ahammed Ovi
    Applied Intelligence, 2022, 52 : 254 - 281
  • [25] A Survey of Parallel Sequential Pattern Mining
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    Yu, Philip S.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (03)
  • [26] Weighted frequent sequential pattern mining
    Islam, Md Ashraful
    Rafi, Mahfuzur Rahman
    Azad, Al-amin
    Ovi, Jesan Ahammed
    APPLIED INTELLIGENCE, 2022, 52 (01) : 254 - 281
  • [27] Fast Weighted Sequential Pattern Mining
    Ye, Zhenqiang
    Li, Ziyang
    Guo, Weibin
    Gan, Wensheng
    Wan, Shicheng
    Chen, Jiahui
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 807 - 818
  • [28] A Stream Sequential Pattern Mining Model
    Li, Haifeng
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 704 - 707
  • [29] Sequential Pattern Mining - Approaches and Algorithms
    Mooney, Carl H.
    Roddick, John F.
    ACM COMPUTING SURVEYS, 2013, 45 (02)
  • [30] SQUIRE: Sequential pattern mining with quantities
    Kim, Chulyun
    Lim, Jong-Hwa
    Ng, Raymond T.
    Shim, Kyuseok
    JOURNAL OF SYSTEMS AND SOFTWARE, 2007, 80 (10) : 1726 - 1745