Discovering knowledge from large databases using prestored information

被引:8
|
作者
Tsai, PSM
Chen, CM
机构
[1] Ming Hsin Inst Technol, Dept Informat Management, Hsinchu 304, Taiwan
[2] Silicon Integrated Syst Corp, Hsinchu, Taiwan
关键词
knowledge discovery; data mining; association rules; sequential patterns;
D O I
10.1016/S0306-4379(01)00006-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we examine the two issues of mining association rules and mining sequential patterns in a large database of sales transactions. The problems of mining association rules and mining sequential patterns focus on discovering large itemsets and large sequences, respectively. We present PSI and PSI-seq for efficient large itemsets generation and large sequences generation, respectively. The main ideas of these two algorithms are using prestored information to minimize the numbers of candidate itemsets and candidate sequences counted in each database scan. The prestored informations for PSI and PSI-seq include the itemsets and the sequences along with their support counts found in the last mining, respectively. Typically a user may require to tune the value of the minimum support many times before a set of useful association rules can be obtained from the transaction database. Using prestored information, the total computation time will be reduced effectively. Empirical results show that our approaches outperform previous methods by an order of magnitude, using little storage space for the prestored information.(C) 2001 Published by Elsevier Science Ltd.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [11] Discovering knowledge in corporate databases
    Yoon, Y
    INFORMATION SYSTEMS MANAGEMENT, 1999, 16 (02) : 64 - 71
  • [12] Discovering knowledge in corporate databases
    Department of Information Systems, Virginia Commonwealth University, Richmond, VA, United States
    Inf. Syst. Manage., 2 (64-71):
  • [13] Discovering causality in large databases
    Zhang, SC
    Zhang, ZG
    APPLIED ARTIFICIAL INTELLIGENCE, 2002, 16 (05) : 333 - 358
  • [14] Discovering Association Rules Change from Large Databases
    Ye, Feiyue
    Liu, Jixue
    Qian, Jin
    Shi, Yuxi
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 388 - +
  • [15] Foreword to articles on - "From information to knowledge using astronomical databases"
    Murtagh, F
    Egret, D
    COMPUTER PHYSICS COMMUNICATIONS, 2000, 127 (2-3) : 175 - 176
  • [16] Fast algorithm to discovering sequential patterns from large databases
    Hu Huirong
    PROCEEDINGS OF THE 24TH CHINESE CONTROL CONFERENCE, VOLS 1 AND 2, 2005, : 1352 - 1355
  • [17] Combining experts' knowledge with information from databases
    Low, BT
    Wang, PZ
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 399 - 404
  • [18] Discovering Association Rules in Large, Dense Databases
    Teusan, Tudor
    Nachouki, Gilles
    Briand, Henri
    Philippe, Jacques
    LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 638 - 645
  • [19] RETRIEVAL OF REACTION INFORMATION FROM LARGE DATABASES
    GRETHE, G
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 66 - CINF
  • [20] Knowledge discovery from very large databases using frequent concept lattices
    Waiyamai, K
    Lakhal, L
    MACHINE LEARNING: ECML 2000, 2000, 1810 : 437 - 445