Significance-based discriminative sequential pattern mining

被引:25
|
作者
He, Zengyou [1 ,2 ]
Zhang, Simeng [1 ]
Wu, Jun [3 ]
机构
[1] Dalian Univ Technol, Sch Software, Tuqiang Rd, Dalian, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Tuqiang Rd 321, Dalian 116600, Peoples R China
[3] Zunyi Normal Univ, Sch Informat Engn, Zunyi, Peoples R China
关键词
Sequential pattern; Discriminative pattern; Multiple hypothesis testing; Family-wise error rate; False discovery rate;
D O I
10.1016/j.eswa.2018.12.046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminative sequential patterns are sub-sequences whose occurrences exhibit significant differences across sequential data sets with different class labels. The discovery of such types of patterns has many practical applications in different fields. To date, various algorithms for mining discriminative sequential patterns have been proposed. However, the reported patterns from these methods usually contain many false positives that only hold in the sample data by chance. To alleviate this issue, we put forward the concept of significance-based discriminative sequential pattern mining and a corresponding algorithm DSPM-MTC (Discriminative Sequential Pattern Mining with Multiple Testing Correction). The key idea of DSPM-MTC is to integrate the multiple hypothesis testing correction procedure into the pattern mining process to generate a pattern set with error rate control. To demonstrate the effectiveness of DSPM-MTC, we conduct a series of experiments on real sequential data sets and simulation data sets. The experimental results show that DSPM-MTC can effectively recognize false discoveries to generate a pattern set with statistical quality control. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:54 / 64
页数:11
相关论文
共 50 条
  • [1] Discriminative Sequential Pattern Mining for Software Failure Detection
    Du, Hao
    Su, Yongchi
    Li, Chunping
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS 2016), 2016, : 153 - 158
  • [2] Significance-Based Essential Protein Discovery
    Liu, Yan
    Liang, Hao
    Zou, Quan
    He, Zengyou
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 633 - 642
  • [3] Mining conditional discriminative sequential patterns
    He, Zengyou
    Zhang, Simeng
    Gu, Feiyang
    Wu, Jun
    [J]. INFORMATION SCIENCES, 2019, 478 : 524 - 539
  • [4] Significance-Based Estimation-of-Distribution Algorithms
    Doerr, Benjamin
    Krejca, Martin S.
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (06) : 1025 - 1034
  • [5] Fault diagnosis based on sequential pattern mining
    Hu, Rui-Fei
    Wang, Ling
    Mei, Xiao-Qin
    Luo, Yang
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2010, 16 (07): : 1412 - 1418
  • [6] Sequential Pattern Mining Algorithm Based on Interestingness
    Li, Tao
    Zhang, Shuaichi
    Chen, Hui
    Ren, Yongjun
    Li, Xiang
    Ren, Yongzhen
    [J]. 2018 FIRST INTERNATIONAL COGNITIVE CITIES CONFERENCE (IC3 2018), 2018, : 69 - 74
  • [7] Algorithms for Context Based Sequential Pattern Mining
    Ziembinski, Radoslaw
    [J]. FUNDAMENTA INFORMATICAE, 2007, 76 (04) : 495 - 510
  • [8] Discriminative learning in sequential pattern recognition
    He, Xiaodong
    Deng, Li
    Chou, Wu
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (05) : 14 - 36
  • [9] Energy Efficiency through Significance-Based Computing
    Nikolopoulos, Dimitrios S.
    Vandierendonck, Hans
    Bellas, Nikolaos
    Antonopoulos, Christos D.
    Lalis, Spyros
    Karakonstantis, Georgios
    Burg, Andreas
    Naumann, Uwe
    [J]. COMPUTER, 2014, 47 (07) : 82 - 85
  • [10] Significance-based Estimation-of-Distribution Algorithms
    Doerr, Benjamin
    Krejca, Martin S.
    [J]. GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 1483 - 1490