Mining sequential patterns from data streams: a centroid approach

被引:0
|
作者
Alice Marascu
Florent Masseglia
机构
[1] INRIA Sophia Antipolis,
关键词
Data streams; Sequential patterns; Web usage mining; Clustering; Sequences alignment;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, emerging applications introduced new constraints for data mining methods. These constraints are typical of a new kind of data: the data streams. In data stream processing, memory usage is restricted, new elements are generated continuously and have to be considered in a linear time, no blocking operator can be performed and the data can be examined only once. At this time, only a few methods has been proposed for mining sequential patterns in data streams. We argue that the main reason is the combinatory phenomenon related to sequential pattern mining. In this paper, we propose an algorithm based on sequences alignment for mining approximate sequential patterns in Web usage data streams. To meet the constraint of one scan, a greedy clustering algorithm associated to an alignment method is proposed. We will show that our proposal is able to extract relevant sequences with very low thresholds.
引用
收藏
页码:291 / 307
页数:16
相关论文
共 50 条
  • [21] Mining Regular Patterns in Data Streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 399 - 413
  • [22] An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams
    Shajib, Md. Badi-Uz-Zaman
    Samiullah, Md.
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 980 - 984
  • [23] Statistical supports for mining sequential patterns and improving the incremental update process on data streams
    Laur, Pierre-Alain
    Symphor, Jean-Emile
    Nock, Richard
    Poncelet, Pascal
    [J]. INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 29 - 47
  • [24] Mining temporal patterns from sequential healthcare data
    Movahedi, Faezeh
    Zhang, Yiye
    Padman, Rema
    Antaki, James F.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 461 - 462
  • [25] Mining sequential patterns from multidimensional sequence data
    Yu, CC
    Chen, YL
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (01) : 136 - 140
  • [26] Sampling for sequential pattern mining:: From static databases to data streams
    Raissi, Chedy
    Poncelet, Pascal
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 631 - +
  • [27] Mining Weighted Frequent Patterns from Uncertain Data Streams
    Ovi, Jesan Ahammed
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 917 - 936
  • [28] Mining frequent patterns from dynamic data streams with data load management
    Li, Chao-Wei
    Jea, Kuen-Fang
    Lin, Ru-Ping
    Yen, Ssu-Fan
    Hsu, Chih-Wei
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (06) : 1346 - 1362
  • [29] The PSP approach for mining sequential patterns
    Masseglia, F
    Cathala, F
    Poncelet, P
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 176 - 184
  • [30] Mining Sequential Patterns in Data Stream
    Huang, Qinhua
    Ouyang, Weimin
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 865 - 874