Sequential Pattern Mining from Stream Data

被引:0
|
作者
Koper, Adam [1 ]
Hung Son Nguyen [1 ]
机构
[1] Univ Warsaw, Inst Math, PL-02097 Warsaw, Poland
关键词
Stream Sequential Patterns Mining; SS-BE; Prefix Span; support measure;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential Pattern Mining, briefly SPM, is an interesting issue in Data Mining that can be applied for temporal or time series data. This paper is related to SPM algorithms that can work with stream data. We present three new stream SPM methods, called SS-BE2, SS-LC and SS-LC2, which are the extensions of SS-BE. The proposed methods, similarly to SS-BE, are dealing with fixed-sized batches using Prefix Span algorithm, and the critical problem in each step is how to store the huge amount of candidate patterns, and how to select the frequent patterns properly. The main idea of based on improving the tree pruning method of the original SS-BE to guarantee the high completeness and correctness of the result. In all experiments performed on benchmark data, the proposed solutions outperform the original SS-BE algorithm. Moreover, the proposed algorithms seems to be scalable, as the usage of memory is linearly depended on the number of patterns, and the size of the buffer.
引用
收藏
页码:278 / 291
页数:14
相关论文
共 50 条
  • [1] Privacy Preserving Sequential Pattern Mining in Data Stream
    Huang, Qin-Hua
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2008, 15 : 69 - 75
  • [2] A Stream Sequential Pattern Mining Model
    Li, Haifeng
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 704 - 707
  • [3] Mining Sequential Patterns in Data Stream
    Huang, Qinhua
    Ouyang, Weimin
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 865 - 874
  • [4] A Fuzzy Constrained Stream Sequential Pattern Mining Algorithm
    Shaken, Omid
    Pedram, Mir Mohsen
    Kelarestaghi, Manoochehr
    2014 7th International Symposium on Telecommunications (IST), 2014, : 20 - 24
  • [5] Stream Sequential Pattern Mining with Precise Error Bounds
    Mendes, Luiz F.
    Ding, Bolin
    Han, Jiawei
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 941 - 946
  • [6] A Sequential Pattern Mining Using Dynamic Weight in Stream Environment
    Choi, Pilsun
    Kim, Hwan
    Hwang, Buhyun
    2014 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2014), 2014, : 507 - 511
  • [7] Distributed and scalable sequential pattern mining through stream processing
    Chun-Chieh Chen
    Hong-Han Shuai
    Ming-Syan Chen
    Knowledge and Information Systems, 2017, 53 : 365 - 390
  • [8] Distributed and scalable sequential pattern mining through stream processing
    Chen, Chun-Chieh
    Shuai, Hong-Han
    Chen, Ming-Syan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (02) : 365 - 390
  • [9] CLOSED SEQUENTIAL PATTERN MINING IN BIOLOGICAL DATA
    Jawahar, S.
    Harishchander, A.
    Devaraju, S.
    Ali, S. Ahamed Johnsha
    Manivasagan, C.
    Sumathi, P.
    INTERNATIONAL JOURNAL OF LIFE SCIENCE AND PHARMA RESEARCH, 2020, : 9 - 13
  • [10] Automatic Sequential Pattern Mining in Data Streams
    Kawabata, Koki
    Matsubara, Yasuko
    Sakurai, Yasushi
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1733 - 1742