Data mining of web access logs from an academic web site

被引:0
|
作者
Ciesielski, V
Lalani, A
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have used a general purpose data mining tool to determine whether we can find any 'golden nuggets' in the web access logs of a large academic web site. Our goal was to use general purpose data mining algorithms to analyse visitors to the website and somehow characterise or distinguish them in some way. We used two web access logs, one from 2001 and one from 2003. We extracted 4 different feature sets from the web logs and used algorithms for classification (1R, J48/C4.5), clustering (EM), association finding (apriori) and feature selection (correlation based subset evaluation with best first search). We discovered several nuggets, the most significant being that a major difference between visitors from within Australia and visitors from outside Australia is that visitors from outside Australia generally arrive via search engines and are interested in information about postgraduate courses.
引用
下载
收藏
页码:1034 / 1043
页数:10
相关论文
共 50 条
  • [31] Mining web logs for recommender a personalized system
    Puntheeranurak, S
    Tsuji, H
    ITRE 2005: 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, PROCEEDINGS, 2005, : 445 - 448
  • [32] Matrix dimensionality reduction for mining Web logs
    Lu, JJ
    Xu, BW
    Yang, HJ
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 405 - 408
  • [33] Mining Frequent Attack Sequence in Web Logs
    Sun, Hui
    Sun, Jianhua
    Chen, Hao
    GREEN, PERVASIVE, AND CLOUD COMPUTING, 2016, 9663 : 243 - 260
  • [34] A unified representation of web logs for mining applications
    Diligenti, Michelangelo
    Gori, Marco
    Maggini, Marco
    INFORMATION RETRIEVAL, 2011, 14 (03): : 215 - 236
  • [35] Mining Web logs for Prediction in Prefetching and Caching
    Songwattana, Areerat
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 1006 - 1011
  • [36] A data mining based method for web site maintenance
    Burn-Thornton, K. E.
    Carrington, M.
    Burman, T.
    INTELLIGENT DATA ANALYSIS, 2006, 10 (06) : 555 - 581
  • [37] Mining web usage data for automatic site personalization
    Mobasher, B
    CLASSIFICATION, AUTOMATION, AND NEW MEDIA, 2002, : 299 - 312
  • [38] Mining maximum frequent access patterns in web logs based on unique labeled tree
    Zhang, Ling
    Yin, Ran-ping
    Zhan, Yu-bin
    WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 73 - 82
  • [39] From ERP to data mining on the web
    Ghenea, S¸erban
    UPB Scientific Bulletin, Series C: Electrical Engineering, 2011, 73 (04): : 89 - 98
  • [40] FROM ERP TO DATA MINING ON THE WEB
    Ghenea, Serban
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2011, 73 (04): : 89 - 98