Efficient mining of temporal traversal patterns from very large Web logs

被引:0
|
作者
Chen, ZX [1 ]
机构
[1] Univ Texas Pan Amer, Dept Comp Sci, Edinburg, TX 78539 USA
关键词
web mining; access session; temporal content page; temporal traversal pattern; suffix tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Web page in a Web access session is considered as a temporal content page, if its access time is greater than the average access time of all the pages in the session. A maximal temporal reference of a Web user in an access session is a longest consecutive sequence of Web pages in the session which ends at a temporal content page and has no other temporal content pages in the sequence. The problem of efficient mining of frequent temporal traversal patterns, i.e., large temporal reference sequences of maximal temporal references, from very large Web logs is important in Web mining. This paper aims for algorithmic solutions to the problem with best possible efficiency. We first design linear time algorithms for finding maximal temporal references from Web logs. We then devise a linear time algorithm for mining frequent temporal traversal patterns, utilizing the technique developed in [8, 9] for fast construction of "shallow" generalized suffix trees over a very large alphabet.
引用
收藏
页码:10 / 16
页数:7
相关论文
共 50 条
  • [1] Linear and sublinear time algorithms for mining frequent traversal path patterns from very large web logs
    Chen, ZX
    Fowler, RH
    Fu, AWC
    Wang, CY
    [J]. SEVENTH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2003, : 117 - 122
  • [2] An efficient incremental algorithm for mining web traversal patterns
    Yen, SJ
    Lee, YS
    Hsieh, MC
    [J]. ICEBE 2005: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2005, : 274 - 281
  • [3] Efficient approach for interactively mining web traversal patterns
    Lee, YS
    Hsieh, MC
    Yen, SJ
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1055 - 1065
  • [4] Efficient mining of traversal patterns
    Xiao, YQ
    Dunham, MH
    [J]. DATA & KNOWLEDGE ENGINEERING, 2001, 39 (02) : 191 - 214
  • [5] Efficient Mining of Utility-Based Web Path Traversal Patterns
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS I-III, PROCEEDINGS,: UBIQUITOUS ICT CONVERGENCE MAKES LIFE BETTER!, 2009, : 2215 - 2218
  • [6] Mining access patterns efficiently from Web logs
    Pei, J
    Han, JW
    Mortazavi-asl, B
    Zhu, H
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 396 - 407
  • [7] Incremental and interactive mining of web traversal patterns
    Lee, Yue-Shi
    Yen, Show-Jane
    [J]. INFORMATION SCIENCES, 2008, 178 (02) : 287 - 306
  • [8] Efficient data mining for path traversal patterns
    Chen, MS
    Park, JS
    Yu, PS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) : 209 - 221
  • [9] Efficient Incremental Mining of Qualified Web Traversal Patterns without Scanning Original Databases
    Ying, Jia-Ching
    Tseng, Vincent S.
    Yu, Philip S.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 338 - +
  • [10] Web mining of preferred traversal patterns in fuzzy environments
    Wu, R
    Tang, WS
    Zhao, RQ
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 2, PROCEEDINGS, 2005, 3642 : 456 - 465