MapReduce-based web mining for prediction of web-user navigation

被引:11
|
作者
Li, Meijing [1 ]
Yu, Xiuming [1 ]
Ryu, Keun Ho [1 ]
机构
[1] Chungbuk Natl Univ, Coll Elect & Comp Engn, Database Bioinformat Lab, Cheongju, South Korea
基金
新加坡国家研究基金会;
关键词
Frequent sequence patterns; MapReduce; web-usage mining; web user behaviour;
D O I
10.1177/0165551514544096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting web user behaviour is typically an application for finding frequent sequence patterns. With the rapid growth of the Internet, a large amount of information is stored in web logs. Traditional frequent-sequence-pattern-mining algorithms are hard pressed to analyse information from within big datasets. In this paper, we propose an efficient way to predict navigation patterns of web users by improving frequent-sequence-pattern-mining algorithms based on the programming model of MapReduce, which can handle huge datasets efficiently. During the experiments, we show that our proposed MapReduce-based algorithm is more efficient than traditional frequent-sequence-pattern-mining algorithms, and by comparing our proposed algorithms with current existed algorithms in web-usage mining, we also prove that using the MapReduce programming model saves time.
引用
收藏
页码:557 / 567
页数:11
相关论文
共 50 条
  • [31] Mining web navigation path fragments
    Gaul, W
    Schmidt-Thieme, L
    MEASUREMENT AND MULTIVARIATE ANALYSIS, 2002, : 249 - 260
  • [32] Efficient mining and prediction of user behavior patterns in mobile web systems
    Tseng, Vincent S.
    Lin, Kawuu W.
    INFORMATION AND SOFTWARE TECHNOLOGY, 2006, 48 (06) : 357 - 369
  • [33] Web navigation prediction based on dynamic threshold heuristics
    Jindal, Honey
    Sardana, Neetu
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 2820 - 2830
  • [34] Dynamic mining for web navigation patterns based on Markov model
    Chen, JJ
    Gao, J
    Hu, J
    Liao, BS
    COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 806 - 811
  • [35] Combined mining of Web server logs and web contents for classifying user navigation patterns and predicting users' future requests
    Liu, Haibin
    Keselj, Vlado
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (02) : 304 - 330
  • [36] Prediction of User's Trustworthiness in Web-based Social Networks via Text Mining
    Mohammadhassanzadeh, Hossein
    Shahriari, Hamid Reza
    ISECURE-ISC INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2013, 5 (02): : 171 - 187
  • [37] Visually mining web user clickpaths
    Mah, T
    Li, Y
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 771 - 774
  • [38] MapReduce-based Closed Frequent Itemset Mining with Efficient Redundancy Filtering
    Wang, Su-Qi
    Yang, Yu-Bin
    Chen, Guang-Peng
    Gao, Yang
    Zhang, Yao
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 449 - 453
  • [39] Accelerating text mining workloads in a MapReduce-based distributed GPU environment
    Wittek, Peter
    Daranyi, Sandor
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (02) : 198 - 206
  • [40] IMPROVED USER NAVIGATION PATTERN PREDICTION TECHNIQUE FROM WEB LOG DATA
    Sujatha, V.
    Punithavalli
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 92 - 99