PSP-AMS: Progressive Mining of Sequential Patterns Across Multiple Streams

被引:12
|
作者
Jaysawal, Bijay Prasad [1 ]
Huang, Jen-Wei [1 ]
机构
[1] Natl Cheng Kung Univ, Inst Comp & Commun Engn, 1 Univ Rd, Tainan 701, Taiwan
关键词
Progressive mining; sequential patterns; multiple data streams; across data streams; across-streams sequential patterns; SLIDING WINDOW;
D O I
10.1145/3281632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential pattern mining is used to find frequent data sequences over time. When sequential patterns are generated, the newly arriving patterns may not be identified as frequent sequential patterns due to the existence of old data and sequences. Progressive sequential pattern mining aims to find the most up-to-date sequential patterns given that obsolete items will be deleted from the sequences. When sequences come with multiple data streams, it is difficult to maintain and update the current sequential patterns. Even worse, when we consider the sequences across multiple streams, previous methods cannot efficiently compute the frequent sequential patterns. In this work, we propose an efficient algorithm PSP-AMS to address this problem. PSP-AMS uses a novel data structure PSP-MS-tree to insert new items, update current items, and delete obsolete items. By maintaining a PSP-MS-tree, PSP-AMS efficiently finds the frequent sequential patterns across multiple streams. The experimental results show that PSP-AMS significantly outperforms previous algorithms for mining of progressive sequential patterns across multiple streams on synthetic data as well as real data.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Incremental Mining of Across-streams Sequential Patterns in Multiple Data Streams
    Yang, Shih-Yang
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    [J]. JOURNAL OF COMPUTERS, 2011, 6 (03) : 449 - 457
  • [2] The PSP approach for mining sequential patterns
    Masseglia, F
    Cathala, F
    Poncelet, P
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 176 - 184
  • [3] Incremental mining of closed sequential patterns in multiple data streams
    Yang S.-Y.
    Chao C.-M.
    Chen P.-Z.
    Sun C.-H.
    [J]. Journal of Networks, 2011, 6 (05) : 728 - 735
  • [4] Mining sequential patterns across multiple sequence databases
    Peng, Wen-Chih
    Liao, Zhung-Xun
    [J]. DATA & KNOWLEDGE ENGINEERING, 2009, 68 (10) : 1014 - 1033
  • [5] PTree: Mining sequential patterns efficiently in multiple data streams environment
    [J]. 1600, Institute of Information Science (29):
  • [6] PTree: Mining Sequential Patterns Efficiently in Multiple Data Streams Environment
    Lee, Guanling
    Chen, Yi-Chun
    Hung, Kuo-Che
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (06) : 1151 - 1169
  • [7] Sequential pattern mining in multiple streams
    Chen, G
    Wu, XD
    Zhu, XQ
    [J]. Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 585 - 588
  • [9] Mining Closed Sequential Patterns in Progressive Databases
    Subramanyam, R. B. V.
    Rao, A. Suresh
    Karnati, Ramesh
    Suvvari, Somaraju
    Somayajulu, D. V. L. N.
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2013, 12 (03)
  • [10] Mining multidimensional sequential patterns over data streams
    Raissi, Chedy
    Plantevit, Marc
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 263 - 272