Research on mining user browsing patterns in large web logs based on Poisson sampling and Sequence Alignment Method

被引:0
|
作者
Liu, Peiqian [1 ]
An, Jiyu [1 ]
Guo, Hairu [1 ]
机构
[1] Henan Polytech Univ Jiaozuo, Sch Comp Sci & Technol, Kaifeng 454003, Peoples R China
关键词
data mining; Poisson sampling; Sequence Alignment Method (SAM);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous growth in the size and use of the Internet is creating large sever logs on web servers and difficulties in the search for information. A sophisticated method to organize the layout of the information and assist user navigation is therefore particularly important. In this paper, we valuate the feasibility of using a Poisson sampling and SAM to mine large web log data. A sample sets selected by Poisson sampling statistically effectively represent the characteristics of the entire dataset. In addition, users are partitioned into clusters using a non-Euclidean distance measure, called Sequence Alignment Method (SAM).
引用
收藏
页码:1119 / 1121
页数:3
相关论文
共 47 条
  • [1] Analysis of large data logs: an application of Poisson sampling on excite web queries
    Ozmutlu, HC
    Spink, A
    Ozmutla, S
    INFORMATION PROCESSING & MANAGEMENT, 2002, 38 (04) : 473 - 490
  • [2] Mining and tracking evolving web user trends from large web server logs
    Hawwash B.
    Nasraoui O.
    Statistical Analysis and Data Mining, 2010, 3 (02): : 106 - 125
  • [3] Mining navigation patterns using a sequence alignment method
    Birgit Hay
    Geert Wets
    Koen Vanhoof
    Knowledge and Information Systems, 2004, 6 (2) : 150 - 163
  • [4] Mining Navigation Patterns Using a Sequence Alignment Method
    Birgit Hay
    Geert Wets
    Koen Vanhoof
    Knowledge and Information Systems, 2004, 6 : 150 - 163
  • [5] Mining navigation patterns using a sequence alignment method
    Hay, B
    Wets, G
    Vanhoof, K
    KNOWLEDGE AND INFORMATION SYSTEMS, 2004, 6 (02) : 150 - 163
  • [6] Web Mining Based on User Access Patterns for Web Personalization
    Wang Xiao-Gang
    Li Yue
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL I, 2009, : 194 - 197
  • [7] The characteristic analysis of web user clusters based on frequent browsing patterns
    Zhang, Zhiwang
    Shi, Yong
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 2, PROCEEDINGS, 2007, 4488 : 490 - +
  • [8] Efficient mining of temporal traversal patterns from very large Web logs
    Chen, ZX
    DMIN '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 10 - 16
  • [9] Web user interest mining based on ontology and patterns
    Su, Xue-Yang
    Zuo, Wan-Li
    Wang, Jun-Hua
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (08): : 1556 - 1563
  • [10] Future view: Web navigation based on learning user's browsing patterns
    Nagino, N
    Yamada, S
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 541 - 544