Online and incremental mining of separately-grouped web access logs

被引:0
|
作者
Woon, YK [1 ]
Ng, WK [1 ]
Lim, EP [1 ]
机构
[1] Nanyang Technol Univ, Singapore 639798, Singapore
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rising popularity of electronic commerce makes data mining an indispensable technology for business competitiveness. The World Wide Web provides abundant raw data in the form of web access logs, web transaction logs and web user profiles. Without data mining tools, it is impossible to make any sense of such massive data. In this paper we focus on web usage mining because it deals most appropriately with understanding user behavioral patterns which is the key to successful customer relationship management. Previous work deals separately on specific issues of web usage mining and make assumptions without taking a holistic view and thus, have limited practical applicability. We formulate a novel and more holistic version of web usage mining termed TRAnsactionized LOgfile Mining (TRALOM) to effectively and correctly identify transactions as well as to mine useful knowledge from web access logs. We also introduce a new data structure, called the WebTrie, to efficiently hold useful preprocessed data so that TRALOM can be done in an online and incremental fashion. Experiments conducted on real web server logs verify the usefulness and practicality of our proposed techniques.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 50 条
  • [1] Mining access patterns efficiently from Web logs
    Pei, J
    Han, JW
    Mortazavi-asl, B
    Zhu, H
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 396 - 407
  • [2] Data mining of web access logs from an academic web site
    Ciesielski, V
    Lalani, A
    [J]. DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 1034 - 1043
  • [3] Detecting Web Crawlers from Web Server Access Logs with Data Mining Classifiers
    Stevanovic, Dusan
    An, Aijun
    Vlajic, Natalija
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 483 - 489
  • [4] A top-down algorithm for mining web access patterns from web logs
    Guo, JK
    Ruan, BJ
    Cheng, ZP
    Su, FZ
    Wang, YQ
    Deng, XB
    Shang, N
    Zhu, YY
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 838 - 843
  • [5] Mining on Web logs for recommendation
    College of Computer Science, Zhejiang University, 38 Zheda Road, Hangzhou 310027 Zhejiang, China
    [J]. WSEAS Trans. Comput., 2006, 9 (1818-1822):
  • [6] Discovering web access patterns and trends by applying OLAP and data mining technology on web logs
    Zaiane, OR
    Xin, M
    Han, JW
    [J]. IEEE INTERNATIONAL FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES -ADL'98-, PROCEEDINGS, 1998, : 19 - 29
  • [7] Mining Web Logs - An Automated Approach
    Sudhamathy, G.
    [J]. PROCEEDINGS OF THE FIRST AMRITA ACM-W CELEBRATION OF WOMEN IN COMPUTING IN INDIA (A2WIC), 2010,
  • [8] Mining Online users' access records for web business intelligence
    Fong, S
    Chan, S
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 759 - 762
  • [9] Mining maximum frequent access patterns in web logs based on unique labeled tree
    Zhang, Ling
    Yin, Ran-ping
    Zhan, Yu-bin
    [J]. WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 73 - 82
  • [10] How People Read Books Online: Mining and Visualizing Web Logs for Use Information
    Chen, Rong
    Rose, Anne
    Bederson, Benjamin B.
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2009, 5714 : 364 - +