A fast algorithm for mining sequential patterns from large databases

被引:3
|
作者
Chen, N [1 ]
Chen, A
Zhou, LX
Liu, L
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100080, Peoples R China
[2] Chinese Acad Sci, Inst Policy & Management, Beijing 100080, Peoples R China
关键词
data mining; knowledge discovery; sequential pattern; set operation;
D O I
10.1007/BF02948984
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Mining sequential patterns from large databases has been recognized by many researchers as an attractive task of data mining and knowledge discovery. Previous algorithms scan the databases for many times, which is often unendurable due to the very large amount of databases. In this paper, the authors introduce an effective algorithm for mining sequential patterns from large databases. In the algorithm, the original database is not used at all for counting the support of sequences after the first pass. Rather, a tidlist; structure generated in the previous pass is employed for the purpose based on set intersection operations, avoiding the multiple scans of the databases.
引用
收藏
页码:359 / 370
页数:12
相关论文
共 50 条
  • [31] Using a genetic algorithm for mining patterns from Endgame Databases
    Raboanary, Heriniaina Andry
    Raboanary, Julien Amedee
    Raboanary, Toky Hajatiana
    [J]. AFRICAN CONFERENCE ON SOFTWARE ENGINEERING AND APPLIED COMPUTING (ACSEAC 2012), 2012, : 94 - 100
  • [32] Mining sequential patterns across multiple sequence databases
    Peng, Wen-Chih
    Liao, Zhung-Xun
    [J]. DATA & KNOWLEDGE ENGINEERING, 2009, 68 (10) : 1014 - 1033
  • [33] The MineSP operator for mining sequential patterns in inductive databases
    Benitez-Guerrero, Edgard
    Hernandez-Lopez, Alma-Rosa
    [J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 684 - +
  • [34] Mining weighted sequential patterns in incremental uncertain databases
    Roy, Kashob Kumar
    Moon, Md Hasibul Haque
    Rahman, Md Mahmudur
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    [J]. INFORMATION SCIENCES, 2022, 582 : 865 - 896
  • [35] Fast discovery of sequential patterns in large databases using effective time-indexing
    Lin, Ming-Yen
    Hsueh, Sue-Chen
    Chang, Chia-Wen
    [J]. INFORMATION SCIENCES, 2008, 178 (22) : 4228 - 4245
  • [36] Incremental update on sequential patterns in large databases
    Lin, MY
    Lee, SY
    [J]. TENTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 24 - 31
  • [37] A New Algorithm for Mining Sequential Patterns
    Zhang, Zhuo
    Zhang, Lu
    Zhong, Shaochun
    Guan, Jiwen
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 625 - +
  • [38] An algorithm for mining generalized sequential patterns
    Ren, JD
    Cheng, YB
    Yang, LL
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1288 - 1292
  • [39] Mining of high utility-probability sequential patterns from uncertain databases
    Zhang, Binbin
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Lie, Ting
    [J]. PLOS ONE, 2017, 12 (07):
  • [40] Mining frequent δ-free patterns in large databases
    Hébert, C
    Crémilleux, B
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2005, 3735 : 124 - 136