Frequent Sequence Mining in Web Log Data

被引:3
|
作者
Weichbroth, Pawel [1 ]
机构
[1] Gdansk Univ Technol, Dept Appl Informat Management, Ul Narutowicza 11-12, PL-80233 Gdansk, Poland
来源
关键词
Sequence; Mining; Web; Usage; ALGORITHM;
D O I
10.1007/978-3-319-67792-7_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of information available even on a single web server can be huge. On the other hand, the amount of visitors (users) can often reach a number of at least six digits. Users vary in gender, age and education, and in consequence their information needs are different. Moreover, they subconsciously expect to get more adequate content after visiting the first few pages. The scope of this kind of problem relates to the domain of information filtering, as a method for delivering relevant information. To solve such a problem, different sources of unstructured or structured data can be used, one of the latter type being web server log data. Executed logging processes on the server side can gather valuable data showing requests sent by users to available resources shared on a particular web site. In this paper, we introduce the Apriori-like FWP algorithm for frequent sequence mining in web log data. Discovered sequences present reconstructed navigation paths across shared web pages by a number of users satisfying a defined minimum. Such knowledge can primarily be used for content recommendation, as well as in cross-marketing strategies and email promotion campaigns.
引用
收藏
页码:459 / 467
页数:9
相关论文
共 50 条
  • [1] Frequent Pattern Mining in Web Log Data
    Ivancsy, Renata
    Vajk, Istvan
    [J]. ACTA POLYTECHNICA HUNGARICA, 2006, 3 (01) : 77 - 90
  • [2] Performance Evaluation of Frequent Pattern Mining Algorithms using Web Log Data for Web Usage Mining
    Gashaw, Yonas
    Liu, Fang
    [J]. 2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [3] webSPADE: A parallel sequence mining algorithm to analyze web log data
    Demiriz, A
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 755 - 758
  • [4] Mining Frequent Attack Sequence in Web Logs
    Sun, Hui
    Sun, Jianhua
    Chen, Hao
    [J]. GREEN, PERVASIVE, AND CLOUD COMPUTING, 2016, 9663 : 243 - 260
  • [5] Web Log Data Analysis and Mining
    Grace, L. K. Joshila
    Maheswari, V.
    Nagamalai, Dhinaharan
    [J]. ADVANCED COMPUTING, PT III, 2011, 133 : 459 - 469
  • [6] Data preparation in web log mining
    Lu, Lina
    Yang, Yiling
    Guan, Xudong
    Wei, Hengyi
    [J]. Jisuanji Gongcheng/Computer Engineering, 2000, 26 (04): : 66 - 67
  • [7] Web log data mining analysis
    Lu Ansheng
    [J]. 2012 INTERNATIONAL CONFERENCE ON INTELLIGENCE SCIENCE AND INFORMATION ENGINEERING, 2012, 20 : 213 - 215
  • [8] User frequent navigation pattern mining model for web log
    Zhou, KJ
    Qu, YF
    Gu, HZ
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (12TH), VOLS 1- 3, 2005, : 209 - 213
  • [9] Preprocessing and mining web log data for web personalization
    Baglioni, M
    Ferrara, U
    Romei, A
    Ruggieri, S
    Turini, F
    [J]. AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 237 - 249
  • [10] A unified approach to web usage mining based on frequent sequence mining
    Inuzuka, Nobuhiro
    Hayakawa, Jun-ichi
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT II, PROCEEDINGS, 2007, 4693 : 987 - 994