An efficient approach to discovering knowledge from large databases

被引:0
|
作者
Yen, SJ
Chen, ALP
机构
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study two problems: mining association rules and mining sequential patterns in a large database of customer transactions. The problem of mining association rules focuses on discovering large itemsets where a large itemset is a group of items which appear together in a sufficient number of transactions; while the problem of mining sequential patterns focuses on discovering large sequences where a large sequence is an ordered list of sets of items which appear in a sufficient number of transactions. We present efficient graph-based algorithms to solve these problems. The algorithms construct an association graph to indicate the associations between items and then traverse the graph to generate large itemsets and large sequences, respectively. Our algorithms need to scan the database only once. Empirical evaluations show that our algorithms outperform other algorithms which need to make multiple passes over the database.
引用
收藏
页码:8 / 18
页数:11
相关论文
共 50 条
  • [31] Discovering representative models in large time series databases
    Rombo, S
    Terracina, G
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 84 - 97
  • [32] Discovering and updating rules from databases
    Faye, A
    Giacometti, A
    Laurent, D
    Spyratos, N
    [J]. COMPUTING ANTICIPATORY SYSTEMS, 1999, 465 : 231 - 243
  • [33] Knowledge discovery in large spatial databases: Focusing techniques for efficient class identification
    Ester, M
    Kriegel, HP
    Xu, XW
    [J]. ADVANCES IN SPATIAL DATABASES, 1995, 951 : 67 - 82
  • [34] EISA: An Efficient Information Theoretical Approach to Value Segmentation in Large Databases
    Wang, Weiqing
    Sadiq, Shazia
    Zhou, Xiaofang
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 224 - 235
  • [35] An efficient sampling approach for mining all association rules in large databases
    Department of Computer Science and Engineering, Shiraz University, Shiraz, Iran
    [J]. Iran. J. Electr. Comput. Eng., 2008, 1 (73-78):
  • [36] Developing an efficient knowledge discovering model for mining fuzzy multi-level sequential patterns in sequence databases
    Huang, Tony Cheng-Kui
    [J]. FUZZY SETS AND SYSTEMS, 2009, 160 (23) : 3359 - 3381
  • [37] Developing an Efficient Knowledge Discovering Model for Mining Fuzzy Multi-level Sequential Patterns in Sequence Databases
    Huang, Tony Cheng-kui
    [J]. 2009 INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION AND SERVICE SCIENCE (NISS 2009), VOLS 1 AND 2, 2009, : 362 - 371
  • [38] Discovering fuzzy clusters in databases using an evolutionary approach
    Chung, LLH
    Chan, KCC
    Leung, H
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 11 - 21
  • [39] Discovering patterns of medical practice in large administrative health databases
    Semenova, T
    [J]. DATA & KNOWLEDGE ENGINEERING, 2004, 51 (02) : 149 - 160
  • [40] FIT: A fast algorithm for discovering frequent itemsets in large databases
    Luo, J
    Rajasekaran, S
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 189 - 195