Efficient mining of temporal traversal patterns from very large Web logs

被引:0
|
作者
Chen, ZX [1 ]
机构
[1] Univ Texas Pan Amer, Dept Comp Sci, Edinburg, TX 78539 USA
关键词
web mining; access session; temporal content page; temporal traversal pattern; suffix tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Web page in a Web access session is considered as a temporal content page, if its access time is greater than the average access time of all the pages in the session. A maximal temporal reference of a Web user in an access session is a longest consecutive sequence of Web pages in the session which ends at a temporal content page and has no other temporal content pages in the sequence. The problem of efficient mining of frequent temporal traversal patterns, i.e., large temporal reference sequences of maximal temporal references, from very large Web logs is important in Web mining. This paper aims for algorithmic solutions to the problem with best possible efficiency. We first design linear time algorithms for finding maximal temporal references from Web logs. We then devise a linear time algorithm for mining frequent temporal traversal patterns, utilizing the technique developed in [8, 9] for fast construction of "shallow" generalized suffix trees over a very large alphabet.
引用
收藏
页码:10 / 16
页数:7
相关论文
共 50 条
  • [21] Mining temporal web interesting patterns
    Hu, Xianwei
    Yin, Ying
    Zhang, Bin
    [J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 227 - +
  • [22] Design and Implementation of an algorithm for finding frequent sequential traversal patterns from web logs based on weight constraint
    Sisodia, Mahendra Singh
    Pathak, Mayank
    Verma, Bhupendra
    Nigam, Rajesh K.
    [J]. 2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 592 - +
  • [23] An efficient web traversal pattern mining algorithm based on suffix array
    Jing, T
    Zuo, WL
    Zhang, BZ
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1535 - 1539
  • [24] Optimal algorithms for finding user access sessions from very large web logs
    Chen, ZX
    Fu, AWC
    Tong, FCH
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2003, 6 (03): : 259 - 279
  • [25] Optimal Algorithms for Finding User Access Sessions from Very Large Web Logs
    Zhixiang Chen
    Ada Wai-Chee Fu
    Frank Chi-Hung Tong
    [J]. World Wide Web, 2003, 6 : 259 - 279
  • [26] Mining user access patterns with traversal constraint for predicting web page requests
    Mei-Ling Shyu
    Choochart Haruechaiyasak
    Shu-Ching Chen
    [J]. Knowledge and Information Systems, 2006, 10 : 515 - 528
  • [27] Mining user access patterns with traversal constraint for predicting web page requests
    Shyu, Mei-Ling
    Haruechaiyasak, Choochart
    Chen, Shu-Ching
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (04) : 515 - 528
  • [28] A Lattice-Based Framework for Interactively and Incrementally Mining Web Traversal Patterns
    Lee, Yue-Shi
    Yen, Show-Jane
    Hsieh, Min-Chi
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2005, 1 (04) : 197 - +
  • [29] Mining temporal patterns in popularity of web items
    Loh, Woong-Kee
    Mane, Sandeep
    Srivastava, Jaideep
    [J]. INFORMATION SCIENCES, 2011, 181 (22) : 5010 - 5028
  • [30] Efficient mining of cross-transaction web usage patterns in large database
    Chen, J
    Ou, LY
    Yin, J
    Huang, J
    [J]. NETWORKING AND MOBILE COMPUTING, PROCEEDINGS, 2005, 3619 : 519 - 528