An efficient approach to discovering knowledge from large databases

被引：0

作者：

Yen, SJ

Chen, ALP

机构：

来源：

PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS | 1996年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we study two problems: mining association rules and mining sequential patterns in a large database of customer transactions. The problem of mining association rules focuses on discovering large itemsets where a large itemset is a group of items which appear together in a sufficient number of transactions; while the problem of mining sequential patterns focuses on discovering large sequences where a large sequence is an ordered list of sets of items which appear in a sufficient number of transactions. We present efficient graph-based algorithms to solve these problems. The algorithms construct an association graph to indicate the associations between items and then traverse the graph to generate large itemsets and large sequences, respectively. Our algorithms need to scan the database only once. Empirical evaluations show that our algorithms outperform other algorithms which need to make multiple passes over the database.

引用

页码：8 / 18

页数：11

共 50 条

[31] Discovering representative models in large time series databases
Rombo, S
Terracina, G
[J]. FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 84 - 97
[32] Discovering and updating rules from databases
Faye, A
Giacometti, A
Laurent, D
Spyratos, N
[J]. COMPUTING ANTICIPATORY SYSTEMS, 1999, 465 : 231 - 243
[33] Knowledge discovery in large spatial databases: Focusing techniques for efficient class identification
Ester, M
Kriegel, HP
Xu, XW
[J]. ADVANCES IN SPATIAL DATABASES, 1995, 951 : 67 - 82
[34] EISA: An Efficient Information Theoretical Approach to Value Segmentation in Large Databases
Wang, Weiqing
Sadiq, Shazia
Zhou, Xiaofang
[J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 224 - 235
[35] An efficient sampling approach for mining all association rules in large databases
Department of Computer Science and Engineering, Shiraz University, Shiraz, Iran
[J]. Iran. J. Electr. Comput. Eng., 2008, 1 (73-78):
[36] Developing an efficient knowledge discovering model for mining fuzzy multi-level sequential patterns in sequence databases
Huang, Tony Cheng-Kui
[J]. FUZZY SETS AND SYSTEMS, 2009, 160 (23) : 3359 - 3381
[37] Developing an Efficient Knowledge Discovering Model for Mining Fuzzy Multi-level Sequential Patterns in Sequence Databases
Huang, Tony Cheng-kui
[J]. 2009 INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION AND SERVICE SCIENCE (NISS 2009), VOLS 1 AND 2, 2009, : 362 - 371
[38] Discovering fuzzy clusters in databases using an evolutionary approach
Chung, LLH
Chan, KCC
Leung, H
[J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 11 - 21
[39] Discovering patterns of medical practice in large administrative health databases
Semenova, T
[J]. DATA & KNOWLEDGE ENGINEERING, 2004, 51 (02) : 149 - 160
[40] FIT: A fast algorithm for discovering frequent itemsets in large databases
Luo, J
Rajasekaran, S
[J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 189 - 195

← 1 2 3 4 5 →