Mining sequential patterns from data streams: a centroid approach

被引:0
|
作者
Alice Marascu
Florent Masseglia
机构
[1] INRIA Sophia Antipolis,
关键词
Data streams; Sequential patterns; Web usage mining; Clustering; Sequences alignment;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, emerging applications introduced new constraints for data mining methods. These constraints are typical of a new kind of data: the data streams. In data stream processing, memory usage is restricted, new elements are generated continuously and have to be considered in a linear time, no blocking operator can be performed and the data can be examined only once. At this time, only a few methods has been proposed for mining sequential patterns in data streams. We argue that the main reason is the combinatory phenomenon related to sequential pattern mining. In this paper, we propose an algorithm based on sequences alignment for mining approximate sequential patterns in Web usage data streams. To meet the constraint of one scan, a greedy clustering algorithm associated to an alignment method is proposed. We will show that our proposal is able to extract relevant sequences with very low thresholds.
引用
收藏
页码:291 / 307
页数:16
相关论文
共 50 条
  • [41] GraSeq: A novel approximate mining approach of sequential patterns over data stream
    Li, Haifeng
    Chen, Hong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 401 - +
  • [42] BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams
    Marwan Hassani
    Daniel Töws
    Alfredo Cuzzocrea
    Thomas Seidl
    [J]. International Journal of Data Science and Analytics, 2019, 8 : 223 - 239
  • [43] BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams
    Hassani, Marwan
    Toews, Daniel
    Cuzzocrea, Alfredo
    Seidl, Thomas
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 8 (03) : 223 - 239
  • [44] Delay: A lazy approach for mining frequent patterns over high speed data streams
    Yang, Hui
    Liu, Hongyan
    He, Jun
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 2 - +
  • [45] DELAY: A lazy approach for mining frequent patterns over high speed data streams
    Yang, Hui
    Liu, Hongyan
    He, Jun
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4632 : 2 - 14
  • [46] An Approach for Mining Weighted Closed Sequential Patterns
    Raju, V. Purushothama
    Varma, G. P. Saradhi
    [J]. 2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 158 - 161
  • [47] Mining Closed Sequential Patterns - A Novel Approach
    Rahaman, Sophia Banu
    Shashi, M.
    [J]. 2012 6TH INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION SCIENCE, SERVICE SCIENCE AND DATA MINING (ISSDM2012), 2012, : 649 - 653
  • [48] An Effective Approach for Mining Weighted Sequential Patterns
    Patel, Mukesh
    Modi, Nilesh
    Passi, Kalpdrum
    [J]. SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 904 - 915
  • [49] A gradational reduction approach for mining sequential patterns
    Huang, Jen-Peng
    Lan, Guo-Cheng
    Kuo, Huang-Cheng
    [J]. NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4570 : 562 - +
  • [50] Data mining on time series of sequential patterns
    Visa, A
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 166 - 171