Perfect hashing schemes for mining traversal patterns

被引:0
|
作者
Chang, CC
Lin, CY
Chou, H
机构
[1] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung 40724, Taiwan
[2] Natl Chung Cheng Univ, Dept Comp Sci & Informat Engn, Chiayi 621, Taiwan
关键词
data mining; traversal patterns; perfect hashing; performance analysis;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Hashing schemes area common technique to improve the performance in mining not only association rules but also sequential patterns or traversal patters. However, the collision problem in hash schemes may result in severe performance degradation. In this paper, we propose perfect hashing schemes for mining traversal patterns to avoid collisions in the hash table. The main idea is to transform each large itemsets into one large 2-itemset by employing a delicate encoding scheme. Then perfect hash schemes designed only for itemsets of length two, rather than varied lengths, are applied. The experimental results show that our method is more than twice as faster than FS algorithm. The results also show our method is scalable to database sizes. One variant of our perfect hash scheme, called partial hash, is proposed to cope with the enormous memory space required by typical perfect hash functions. We also give a comparison of the performances of different perfect hash variants and investigate their properties.
引用
收藏
页码:185 / 202
页数:18
相关论文
共 50 条
  • [1] Perfect hashing schemes for mining association rules
    Chang, C.-C. (ccc@cs.ccu.edu.tw), 1600, Oxford University Press (48):
  • [2] Perfect hashing schemes for mining association rules
    Chang, CC
    Lin, CY
    COMPUTER JOURNAL, 2005, 48 (02): : 168 - 179
  • [3] Mining traversal patterns on the Internet
    Chen, TS
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (12): : 2722 - 2730
  • [4] Efficient mining of traversal patterns
    Xiao, YQ
    Dunham, MH
    DATA & KNOWLEDGE ENGINEERING, 2001, 39 (02) : 191 - 214
  • [5] Mining trip traversal patterns on the Internet
    Chen, TS
    Chang, CY
    Chen, YS
    IC'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS I AND II, 2001, : 730 - 736
  • [6] Efficient data mining for path traversal patterns
    Chen, MS
    Park, JS
    Yu, PS
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) : 209 - 221
  • [7] Incremental and interactive mining of web traversal patterns
    Lee, Yue-Shi
    Yen, Show-Jane
    INFORMATION SCIENCES, 2008, 178 (02) : 287 - 306
  • [8] A New Perfect Hashing and Pruning Algorithm for Mining Association Rule
    Najadat, Hassan
    Amani, Shatnawi
    Ghadeer, Obiedat
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 3 AND 4, 2010, : 2524 - 2531
  • [9] Web mining of preferred traversal patterns in fuzzy environments
    Wu, R
    Tang, WS
    Zhao, RQ
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 2, PROCEEDINGS, 2005, 3642 : 456 - 465
  • [10] Efficient approach for interactively mining web traversal patterns
    Lee, YS
    Hsieh, MC
    Yen, SJ
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1055 - 1065