Mining Interesting Knowledge from Web-Log

被引:1
|
作者
ZHOU Hong-fang 1
2. School of Computer Science and Engineering
机构
关键词
Web Mining; user preferred path; Web-log; support-interest; personalized services;
D O I
暂无
中图分类号
TP393 [计算机网络];
学科分类号
081201 ; 1201 ;
摘要
Web-log contains a lot of information related with user activities on the Internet. How to mine user browsing interest patterns effectively is an important and challengeable research topic. On the analysis of the present algorithm’s advantages and disadvantages, we propose a new concept: support-interest. Its key insight is that visitor will backtrack if they do not find the information where they expect. And the point from where they backtrack is the expected location for the page. We present User Access Matrix and the corresponding algorithm for discovering such expected locations that can handle page caching by the browser. Since the URL-URL matrix is a sparse matrix which can be represented by List of 3-tuples, we can mine user preferred sub-paths from the computation of this matrix. Accordingly, all the sub-paths are merged, and user preferred paths are formed. Experiments showed that it was accurate and scalable. It’s suitable for website based application, such as to optimize website’s topological structure or to design personalized services.
引用
收藏
页码:569 / 574
页数:6
相关论文
共 50 条
  • [1] Mining user preferred knowledge from web-log
    Zhou Hong-fang
    Feng Bo-qin
    Yue Hui
    Lv Lin-tao
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 121 - 124
  • [2] Web-log mining for predictive Web caching
    Yang, Q
    Zhang, HH
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) : 1050 - 1053
  • [3] 高效的WEB-Log Mining算法
    马涛
    刘昱琨
    王希雷
    [J]. 计算机技术与发展, 2003, (04) : 71 - 72
  • [4] Spatial Mining of Web-Log to observe reachability of Website
    Kamakshi
    Mehrotra, Deepti
    Deep, Vikas
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2018,
  • [5] 基于Web-Log Mining的Web文档聚类
    苏中
    马少平
    杨强
    张宏江
    [J]. 软件学报, 2002, (01) : 99 - 104
  • [6] A novel datacube model supporting interactive web-log mining
    Ohmori, T
    Tsutatani, Y
    Hoshi, M
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON CYBER WORLDS, PROCEEDINGS, 2002, : 419 - 427
  • [7] 基于Web-Log Mining的N元预测模型
    苏中
    马少平
    杨强
    张宏江
    [J]. 软件学报, 2002, (01) : 136 - 141
  • [8] Research of Weighted Frequent Patterns Algorithm Based on Web-Log Mining
    Zou, Lixia
    Xue, Haiyan
    [J]. APPLIED INFORMATICS AND COMMUNICATION, PT I, 2011, 224 : 241 - +
  • [9] Research of Weighted Frequent Patterns Algorithm based on Web-log mining
    Zou, Lixia
    Xue, Haiyan
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL I, 2010, : 147 - 150
  • [10] Web-log cleaning for constructing sequential classifiers
    Yang, Q
    Li, TI
    Wang, K
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2003, 17 (5-6) : 431 - 441