Processing sequential patterns in relational databases

被引:0
|
作者
Shang, XQ
Sattler, KU
机构
[1] Univ Magdeburg, Dept Comp Sci, D-39106 Magdeburg, Germany
[2] Tech Univ Ilmenau, Dept Comp Sci & Automat, D-98684 Ilmenau, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Database integration of data mining has gained popularity and its significance is well recognized. However, the performance of SQL based data mining is known to fall behind specialized implementation since the prohibitive nature of the cost associated with extracting knowledge, as well as the lack of suitable declarative query language support. Recent studies have found that for association rule mining and sequential pattern mining with carefully tuned SQL formulations it is possible to achieve performance comparable to systems that cache the data in files outside the DBMS. However most of the previous pattern mining methods follow the method of Apriori which still encounters problems when a sequential database is large and/or when sequential patterns to be mined axe numerous and long. In this paper, we present a novel SQL based approach that we recently proposed, called Prospad (PROjection Sequential PAttern Discovery). Prospad fundamentally differs from an Apriori-like candidate set generation-and-test approach. This approach is a pattern growth-based approach without candidate generation. It grows longer patterns from shorter ones by successively projecting the sequential table into subsequential tables. Since a projected table for a sequential pattern i contains all and only necessary information for mining the sequential patterns that can grow from i, the size of the projected table usually reduces quickly as mining proceeds to longer patterns. Moreover, avoiding creating and dropping cost of some temporary tables, depth first approach is used to facilitate the projecting process.
引用
收藏
页码:438 / 447
页数:10
相关论文
共 50 条
  • [1] JOIN PROCESSING IN RELATIONAL DATABASES
    MISHRA, P
    EICH, MH
    [J]. COMPUTING SURVEYS, 1992, 24 (01) : 63 - 113
  • [2] Processing XPath expressions in relational databases
    Pankowski, T
    [J]. SOFSEM 2004: THEORY AND PRACTICE OF COMPUTER SCIENCE, PROCEEDINGS, 2004, 2932 : 265 - 276
  • [3] CONCEPTUAL DESIGN PATTERNS FOR RELATIONAL DATABASES
    Vitacolonna, Nicola
    [J]. INFORMATION TECHNOLOGIES' 2011, 2011, : 239 - 246
  • [4] Optimizing runtime XML processing in relational databases
    Kogan, E
    Schaller, G
    Rys, M
    Huu, HH
    Krishnaswamy, B
    [J]. DATABASE AND XML TECHNOLOGIES, PROCEEDINGS, 2005, 3671 : 222 - 236
  • [5] Detecting anomalous access patterns in relational databases
    Kamra, Ashish
    Terzi, Evimaria
    Bertino, Elisa
    [J]. VLDB JOURNAL, 2008, 17 (05): : 1063 - 1077
  • [6] Discovering relational patterns across multiple databases
    Zhu, Xingquan
    Wu, Xindong
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 701 - +
  • [7] Detecting anomalous access patterns in relational databases
    Ashish Kamra
    Evimaria Terzi
    Elisa Bertino
    [J]. The VLDB Journal, 2008, 17 : 1063 - 1077
  • [8] Analyzing sequential patterns in retail databases
    Yun, Unil
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (02) : 287 - 296
  • [9] Analyzing Sequential Patterns in Retail Databases
    Unil Yun
    [J]. Journal of Computer Science and Technology, 2007, 22 : 287 - 296
  • [10] A self-processing network model for relational databases
    De-Medonsa, E
    Kraus, S
    Shiftan, Y
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1999, 29 (02): : 198 - 225