Discovering knowledge from large databases using prestored information

被引:8
|
作者
Tsai, PSM
Chen, CM
机构
[1] Ming Hsin Inst Technol, Dept Informat Management, Hsinchu 304, Taiwan
[2] Silicon Integrated Syst Corp, Hsinchu, Taiwan
关键词
knowledge discovery; data mining; association rules; sequential patterns;
D O I
10.1016/S0306-4379(01)00006-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we examine the two issues of mining association rules and mining sequential patterns in a large database of sales transactions. The problems of mining association rules and mining sequential patterns focus on discovering large itemsets and large sequences, respectively. We present PSI and PSI-seq for efficient large itemsets generation and large sequences generation, respectively. The main ideas of these two algorithms are using prestored information to minimize the numbers of candidate itemsets and candidate sequences counted in each database scan. The prestored informations for PSI and PSI-seq include the itemsets and the sequences along with their support counts found in the last mining, respectively. Typically a user may require to tune the value of the minimum support many times before a set of useful association rules can be obtained from the transaction database. Using prestored information, the total computation time will be reduced effectively. Empirical results show that our approaches outperform previous methods by an order of magnitude, using little storage space for the prestored information.(C) 2001 Published by Elsevier Science Ltd.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] An efficient approach to discovering knowledge from large databases
    Yen, SJ
    Chen, ALP
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS, 1996, : 8 - 18
  • [2] Hierarchical analysis for discovering knowledge in large databases
    Pai, WC
    [J]. INFORMATION SYSTEMS MANAGEMENT, 2003, 21 (01) : 81 - 88
  • [3] Discovering knowledge from medical databases
    Wong, ML
    Lam, W
    Leung, KS
    Cheng, JCY
    [J]. WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 8, PROCEEDINGS: CONCEPTS AND APPLICATIONS OF SYSTEMICS, CYBERNETICS AND INFORMATICS, 1999, : 241 - 246
  • [4] Discovering knowledge from noisy databases using genetic programming
    Wong, Man Leung
    Leung, Kwong Sak
    Cheng, Jack C. Y.
    [J]. Journal of the American Society for Information Science and Technology, 2000, 51 (09): : 870 - 881
  • [5] Discovering knowledge from medical databases using evolutionary algorithms
    Wong, ML
    Lam, W
    Leung, KS
    Ngan, PS
    Cheng, JCY
    [J]. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2000, 19 (04): : 45 - 55
  • [6] Discovering knowledge from noisy databases using genetic programming
    Wong, ML
    Leung, KS
    Cheng, JCY
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 2000, 51 (09): : 870 - 881
  • [7] A novel approach for discovering retail knowledge with price information from transaction databases
    Chen, Yen-Liang
    Huang, Tony Cheng-Kul
    Chang, Sih-Kai
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (04) : 2350 - 2359
  • [8] Discovering Robust Knowledge from Databases that Change
    Chun-Nan Hsu
    Craig A. Knoblock
    [J]. Data Mining and Knowledge Discovery, 1998, 2 : 69 - 95
  • [9] Discovering robust knowledge from databases that change
    Hsu, CN
    Knoblock, CA
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (01) : 69 - 95
  • [10] Discovering fuzzy functional dependencies as semantic knowledge in large databases
    Wang, X
    Chen, GQ
    [J]. SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1136 - 1139