Data mining of web access logs from an academic web site

被引:0
|
作者
Ciesielski, V
Lalani, A
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have used a general purpose data mining tool to determine whether we can find any 'golden nuggets' in the web access logs of a large academic web site. Our goal was to use general purpose data mining algorithms to analyse visitors to the website and somehow characterise or distinguish them in some way. We used two web access logs, one from 2001 and one from 2003. We extracted 4 different feature sets from the web logs and used algorithms for classification (1R, J48/C4.5), clustering (EM), association finding (apriori) and feature selection (correlation based subset evaluation with best first search). We discovered several nuggets, the most significant being that a major difference between visitors from within Australia and visitors from outside Australia is that visitors from outside Australia generally arrive via search engines and are interested in information about postgraduate courses.
引用
下载
收藏
页码:1034 / 1043
页数:10
相关论文
共 50 条
  • [21] Ontology-based partitioning of data steam for Web mining: A case study of Web logs
    Jung, JJ
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 247 - 254
  • [22] Mining and tracking evolving web user trends from large web server logs
    Hawwash B.
    Nasraoui O.
    Statistical Analysis and Data Mining, 2010, 3 (02): : 106 - 125
  • [23] Analysis of Web Site Using Web Log Expert Tool Based on Web Data Mining
    Singh, Satya Prakash
    Meenu
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [24] Detection of Malicious Requests on Web Logs Using Data Mining Techniques
    Sahin, Mehmet Emin
    Ozdemir, Suat
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 463 - 468
  • [25] Discovery of web frequent patterns and user characteristics from web access logs: A framework for dynamic web personalization
    Dua, S
    Cho, EC
    Iyengar, SS
    3RD IEEE SYMPOSIUM ON APPLICATION SPECIFIC SYSTEMS AND SOFTWARE ENGINEERING TECHNOLOGY, PROCEEDINGS, 2000, : 3 - 8
  • [26] Enhancing web access using data mining techniques
    Vaisman, AA
    Dandretta, G
    Sapia, M
    14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 327 - 331
  • [27] Characterizing crawler behavior from Web server access logs
    Dikaiakos, M
    Stassopoulou, A
    Papageorgiou, L
    E-COMMERCE AND WEB TECHNOLOGIES, PROCEEDINGS, 2003, 2738 : 369 - 378
  • [28] Efficient frequent pattern mining on web logs
    Sun, LP
    Zhang, XZ
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 533 - 542
  • [29] A unified representation of web logs for mining applications
    Michelangelo Diligenti
    Marco Gori
    Marco Maggini
    Information Retrieval, 2011, 14 : 215 - 236
  • [30] Research on analysis and mining of web query logs
    Fu, B. (bfu@ir.hit.edu.cn), 1800, Chinese Institute of Electronics (41):