Discovering Patterns from Large and Dynamic Sequential Data

被引:5
|
作者
Wang K. [1 ]
机构
[1] Dept. of Info. Syst. and Comp. Sci., National University of Singapore, Singapore, 119260, Lower Kent Ridge Road
关键词
Combinatorial pattern matching; Data mining; Sequential pattern; Suffix tree; Update;
D O I
10.1023/A:1008689103430
中图分类号
学科分类号
摘要
Most daily and scientific data are sequential in nature. Discovering important patterns from such data can benefit the user and scientist by predicting coming activities, interpreting recurring phenomena, extracting outstanding similarities and differences for close attention, compressing data, and detecting intrusion. We consider the following incremental discovery problem for large and dynamic sequential data. Suppose that patterns were previously discovered and materialized. An update is made to the sequential database. An incremental discovery will take advantage of discovered patterns and compute only the change by accessing the affected part of the database and data structures. In addition to patterns, the statistics and position information of patterns need to be updated to allow further analysis and processing on patterns. We present an efficient algorithm for the incremental discovery problem. The algorithm is applied to sequential data that honors several sequential patterns modeling weather changes in Singapore. The algorithm finds what it is supposed to find. Experiments show that for small updates and large databases, the incremental discovery algorithm runs in time independent of the data size.
引用
收藏
页码:33 / 56
页数:23
相关论文
共 50 条
  • [11] Discovering recency, frequency, and monetary (RFM) sequential patterns from customers' purchasing data
    Chen, Yen-Liang
    Kuo, Mi-Hao
    Wu, Shin-Yi
    Tang, Kwei
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2009, 8 (05) : 241 - 251
  • [12] Discovering Sequential Patterns by Neural Networks
    Nowak, Jakub
    Korytkowski, Marcin
    Scherer, Rafal
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [13] Discovering maximum sequential patterns efficiently
    Lu, S
    Li, CP
    DMIN '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 271 - 274
  • [14] Efficiently Discovering Most-Specific Mixed Patterns from Large Data Trees
    Wu, Xiaoying
    Theodoratos, Dimitri
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT I, 2017, 10177 : 279 - 294
  • [15] Discovering α-patterns from gene expression data
    Rodriguez Baena, Domingo S.
    Diaz Diaz, Norberto
    Aguilar Ruiz, Jesus S.
    Nepomuceno Chamorro, Isabel
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 831 - +
  • [16] Discovering Novelty in Sequential Patterns: application for analysis of microarray data on Alzheimer disease
    Sandra, Bringay
    Mathieu, Roche
    Maguelonne, Teisseire
    Pascal, Poncelet
    Ronza, Abdel Rassoul
    Jean-Michel, Verdier
    Gina, Devau
    MEDINFO 2010, PTS I AND II, 2010, 160 : 1314 - 1318
  • [17] An Efficient Approach for Discovering and Maintaining Sequential Patterns
    Yen, Show - Jane
    Lee, Yue -shi
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (01) : 201 - 213
  • [18] Discovering Sequential Rental Patterns by Fleet Tracking
    Jiang, Xinxin
    Peng, Xueping
    Long, Guodong
    DATA SCIENCE, 2015, 9208 : 42 - 49
  • [19] Constraint relaxations for discovering unknown sequential patterns
    Antunes, C
    Oliveira, AL
    KNOWLEDGE DISCOVERY IN INDUCTIVE DATABASES, 2005, 3377 : 11 - 32
  • [20] Discovering sequential patterns with various time constraints
    Song, Wei
    Yang, Kai
    Journal of Computational Information Systems, 2013, 9 (15): : 6047 - 6054