Data mining of web access logs from an academic web site

被引:0
|
作者
Ciesielski, V
Lalani, A
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have used a general purpose data mining tool to determine whether we can find any 'golden nuggets' in the web access logs of a large academic web site. Our goal was to use general purpose data mining algorithms to analyse visitors to the website and somehow characterise or distinguish them in some way. We used two web access logs, one from 2001 and one from 2003. We extracted 4 different feature sets from the web logs and used algorithms for classification (1R, J48/C4.5), clustering (EM), association finding (apriori) and feature selection (correlation based subset evaluation with best first search). We discovered several nuggets, the most significant being that a major difference between visitors from within Australia and visitors from outside Australia is that visitors from outside Australia generally arrive via search engines and are interested in information about postgraduate courses.
引用
下载
收藏
页码:1034 / 1043
页数:10
相关论文
共 50 条
  • [1] Detecting Web Crawlers from Web Server Access Logs with Data Mining Classifiers
    Stevanovic, Dusan
    An, Aijun
    Vlajic, Natalija
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 483 - 489
  • [2] Mining access patterns efficiently from Web logs
    Pei, J
    Han, JW
    Mortazavi-asl, B
    Zhu, H
    KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 396 - 407
  • [3] Discovering web access patterns and trends by applying OLAP and data mining technology on web logs
    Zaiane, OR
    Xin, M
    Han, JW
    IEEE INTERNATIONAL FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES -ADL'98-, PROCEEDINGS, 1998, : 19 - 29
  • [4] A top-down algorithm for mining web access patterns from web logs
    Guo, JK
    Ruan, BJ
    Cheng, ZP
    Su, FZ
    Wang, YQ
    Deng, XB
    Shang, N
    Zhu, YY
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 838 - 843
  • [5] Mining web logs for personalized site maps
    Toolan, F
    Kusmerick, N
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 232 - 237
  • [6] Web usage mining: extracting unexpected periods from web logs
    F. Masseglia
    P. Poncelet
    M. Teisseire
    A. Marascu
    Data Mining and Knowledge Discovery, 2008, 16 : 39 - 65
  • [7] Web usage mining: extracting unexpected periods from web logs
    Masseglia, F.
    Poncelet, P.
    Teisseire, M.
    Marascu, A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (01) : 39 - 65
  • [8] Mining on Web logs for recommendation
    College of Computer Science, Zhejiang University, 38 Zheda Road, Hangzhou 310027 Zhejiang, China
    WSEAS Trans. Comput., 2006, 9 (1818-1822):
  • [9] Mining web logs to locate target web pages
    Guo, Ping
    Yang, Houqun
    Chen, Ting
    Wang, Yanxia
    Journal of Computational Information Systems, 2007, 3 (04): : 1691 - 1698
  • [10] Web Site Auditing Using Web Access Log Data
    He, Si
    Balecel, Nabil
    Hamam, Habib
    Bouslimani, Yassine
    2009 7TH ANNUAL COMMUNICATION NETWORKS AND SERVICES RESEARCH CONFERENCE, 2009, : 94 - +