MapReduce-based web mining for prediction of web-user navigation

被引:11
|
作者
Li, Meijing [1 ]
Yu, Xiuming [1 ]
Ryu, Keun Ho [1 ]
机构
[1] Chungbuk Natl Univ, Coll Elect & Comp Engn, Database Bioinformat Lab, Cheongju, South Korea
基金
新加坡国家研究基金会;
关键词
Frequent sequence patterns; MapReduce; web-usage mining; web user behaviour;
D O I
10.1177/0165551514544096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting web user behaviour is typically an application for finding frequent sequence patterns. With the rapid growth of the Internet, a large amount of information is stored in web logs. Traditional frequent-sequence-pattern-mining algorithms are hard pressed to analyse information from within big datasets. In this paper, we propose an efficient way to predict navigation patterns of web users by improving frequent-sequence-pattern-mining algorithms based on the programming model of MapReduce, which can handle huge datasets efficiently. During the experiments, we show that our proposed MapReduce-based algorithm is more efficient than traditional frequent-sequence-pattern-mining algorithms, and by comparing our proposed algorithms with current existed algorithms in web-usage mining, we also prove that using the MapReduce programming model saves time.
引用
收藏
页码:557 / 567
页数:11
相关论文
共 50 条
  • [1] A MapReduce-Based User Identification Algorithm in Web Usage Mining
    Srivastava, Mitali
    Garg, Rakhi
    Mishra, P. K.
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2018, 13 (02) : 11 - 23
  • [2] The Prediction of Web User's Behavior Based on Web Usage Mining
    Bai Xiaoli
    Chen Fei
    2010 ETP/IITA CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2010), 2010, : 32 - 35
  • [3] Prediction of user navigation patterns by mining the temporal web usage evolution
    Tseng, Vincent S.
    Lin, Kawuu Weicheng
    Chang, Jeng-Chuan
    SOFT COMPUTING, 2008, 12 (02) : 157 - 163
  • [4] Prediction of user navigation patterns by mining the temporal web usage evolution
    Vincent S. Tseng
    Kawuu Weicheng Lin
    Jeng-Chuan Chang
    Soft Computing, 2008, 12 : 157 - 163
  • [5] A MapReduce-Based Framework for Analyzing Web Logs in Offline Streams
    Chen, Ruoyu
    Zhang, Yangsen
    Bi, Rongrong
    Jiang, Yuru
    Zhang, Yanhua
    2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 178 - 183
  • [6] Performance Evaluation of the MapReduce-based Parallel Data Preprocessing Algorithm in Web Usage Mining with Robot Detection Approaches
    Srivastava, Mitali
    Srivastava, Atul Kumar
    Garg, Rakhi
    Mishra, P. K.
    IETE TECHNICAL REVIEW, 2022, 39 (04) : 865 - 879
  • [7] Mining Web Navigation Patterns with Dynamic Thresholds for Navigation Prediction
    Ying, Lia-Ching
    Chin, Chu-Yu
    Tseng, Vincent S.
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 614 - 619
  • [8] TAREEG: A MapReduce-Based Web Service for Extracting Spatial Data from OpenStreetMap
    Alarabi, Louai
    Eldawy, Ahmed
    Alghamdi, Rami
    Mokbel, Mohamed F.
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 897 - 900
  • [9] Web Mining Based on User Access Patterns for Web Personalization
    Wang Xiao-Gang
    Li Yue
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL I, 2009, : 194 - 197
  • [10] User frequent navigation pattern mining model for web log
    Zhou, KJ
    Qu, YF
    Gu, HZ
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (12TH), VOLS 1- 3, 2005, : 209 - 213