Discovering Patterns from Large and Dynamic Sequential Data

被引:7
|
作者
Wang K. [1 ]
机构
[1] Dept. of Info. Syst. and Comp. Sci., National University of Singapore, Singapore, 119260, Lower Kent Ridge Road
关键词
Combinatorial pattern matching; Data mining; Sequential pattern; Suffix tree; Update;
D O I
10.1023/A:1008689103430
中图分类号
学科分类号
摘要
Most daily and scientific data are sequential in nature. Discovering important patterns from such data can benefit the user and scientist by predicting coming activities, interpreting recurring phenomena, extracting outstanding similarities and differences for close attention, compressing data, and detecting intrusion. We consider the following incremental discovery problem for large and dynamic sequential data. Suppose that patterns were previously discovered and materialized. An update is made to the sequential database. An incremental discovery will take advantage of discovered patterns and compute only the change by accessing the affected part of the database and data structures. In addition to patterns, the statistics and position information of patterns need to be updated to allow further analysis and processing on patterns. We present an efficient algorithm for the incremental discovery problem. The algorithm is applied to sequential data that honors several sequential patterns modeling weather changes in Singapore. The algorithm finds what it is supposed to find. Experiments show that for small updates and large databases, the incremental discovery algorithm runs in time independent of the data size.
引用
收藏
页码:33 / 56
页数:23
相关论文
共 50 条
  • [1] Fast algorithm to discovering sequential patterns from large databases
    Hu Huirong
    [J]. PROCEEDINGS OF THE 24TH CHINESE CONTROL CONFERENCE, VOLS 1 AND 2, 2005, : 1352 - 1355
  • [2] Discovering Novelty in Gene Data: From Sequential Patterns to Visualization
    Sallaberry, Arnaud
    Pecheur, Nicolas
    Bringay, Sandra
    Roche, Mathieu
    Teisseire, Maguelonne
    [J]. ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 534 - +
  • [3] DISCOVERING SEQUENTIAL PATTERNS FROM SMART PHONE PERSONAL HEALTH DATA
    Guo, S. H.
    Cui, W.
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2016, 118 : 95 - 95
  • [4] An Efficient Approach to Discovering Sequential Patterns in Large Databases
    Yen, Show-Jane
    Cho, Chung-Wen
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 685 - 690
  • [5] A data mining approach to discovering reliable sequential patterns
    Shyur, Huan-Jyh
    Jou, Chichang
    Chang, Keng
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (08) : 2196 - 2203
  • [6] Discovering Sequential Patterns from Medical Datasets
    AlZahrani, Mohammed Y.
    Mazarbhuiya, Fokrul Alom
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 70 - 74
  • [7] Discovering closed and maximal embedded patterns from large tree data
    Wu, Xiaoying
    Theodoratos, Dimitri
    Mamoulis, Nikos
    [J]. DATA & KNOWLEDGE ENGINEERING, 2021, 133
  • [8] Discovering dynamic patterns from infectious disease data using dynamic mode decomposition
    Proctor, Joshua L.
    Eckhoff, Philip A.
    [J]. INTERNATIONAL HEALTH, 2015, 7 (02): : 139 - 145
  • [9] An efficient data mining technique for discovering interesting sequential patterns
    Yen, SJ
    Lee, YS
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 663 - 664
  • [10] Discovering recency, frequency, and monetary (RFM) sequential patterns from customers' purchasing data
    Chen, Yen-Liang
    Kuo, Mi-Hao
    Wu, Shin-Yi
    Tang, Kwei
    [J]. ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2009, 8 (05) : 241 - 251