Performance Evaluation of a MapReduce Hadoop-based Implementation for Processing Large Virtual Campus Log Files

被引:4
|
作者
Xhafa, Fatos [1 ]
Garcia, Daniel [1 ]
Ramirez, Daniel [1 ]
Caballe, Santi [2 ]
机构
[1] Univ Politecn Cataluna, Barcelona, Spain
[2] Open Univ Catalonia, Barcelona, Spain
关键词
Big Data; Massive Processing; Learning Analytics; Mining; Performance; Virtual Campus; Log Files; MapReduce; Hadoop; Cloud Computing; ANALYTICS;
D O I
10.1109/3PGCIC.2015.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing technologies are bringing new scales of computational processing power and storage capacity to meet very demanding requirements of today's applications. One such family of applications is the one of analytics based on processing big data. More specifically, there is a large family of analytics applications from processing log data files. Indeed, log data files are commonplace in many Internet-based systems and applications, comprising system logs, server logs, application logs, databases logs, user activity logs, etc. These applications are analytics oriented applications based on processing the various types of log files. While log data file processing has been recently an issue of investigation by many researchers and developers, the new feature is that of scale: Cloud based systems can enable processing unlimited amount of data either off-line or online in streaming mode. In this work we evaluate the performance of a MapReduce Hadoop-based implementation for processing large log data files of a Virtual Campus. The study aims to reveal the potential of using such implementations as a basis for learning analytics for use by a variety of users in a Virtual Campus.
引用
收藏
页码:200 / 206
页数:7
相关论文
共 16 条
  • [1] Performance Evaluation of Data Mining Frameworks in Hadoop Cluster Using Virtual Campus Log Files
    Xhafa, Fatos
    Ramirez, Daniel
    Garcia, Daniel
    Caballe, Santi
    [J]. 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS IEEE INCOS 2015, 2015, : 217 - 222
  • [2] Performance Analysis of Hadoop-Based SQL and NoSQL for Processing Log Data
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    Won, Hee-Sun
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, 2015, 9052 : 293 - 299
  • [3] Performance Evaluation of Hadoop-based Large-scale Network Traffic Analysis Cluster
    Tao, Ran
    Qiao, Yuanyuan
    Zhou, Wenli
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2016), 2016, 56
  • [4] Hadoop-based Implementation of Processing Medical Diagnostic Records for Visual Patient System
    Yang, Yuanyuan
    Shi, Liehang
    Xie, Zhe
    Zhang, Jianguo
    [J]. MEDICAL IMAGING 2018: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2018, 10579
  • [5] A Performance Analysis of MapReduce Task with Large Number of Files Dataset in Big Data Using Hadoop
    Pal, Amrit
    Agrawal, Pinki
    Jain, Kunal
    Agrawal, Sanjay
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2014, : 587 - 591
  • [6] Performance Analysis of MapReduce on OpenStack-based Hadoop Virtual Cluster
    Ahmad, Nazrul M.
    Yaacob, Asrul Hadi
    Amin, Anang Hudaya Muhamad
    Kannan, Subarmaniam
    [J]. 2014 IEEE 2ND INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATION TECHNOLOGIES (ISTT), 2014, : 132 - 137
  • [7] An ARM-Based Hadoop Performance Evaluation Platform: Design and Implementation
    Fan, Xiaohu
    Chen, Si
    Qi, Shipeng
    Luo, Xincheng
    Zeng, Jing
    Huang, Hao
    Xie, Changsheng
    [J]. COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS, AND WORKSHARING, COLLABORATECOM 2015, 2016, 163 : 82 - 94
  • [8] Evaluation of Patient Specific Machine Delivery Performance Based On Analysis of Trajectory Log Files
    Stakhursky, V.
    Stanley, T.
    Yi, W.
    [J]. MEDICAL PHYSICS, 2012, 39 (06) : 3791 - 3791
  • [9] The Performance Evaluation of a Distributed Image Classification Pipeline Based on Hadoop and MapReduce with Initial Application to Medical Images
    Guo, Shujian
    Zhang, Yaonan
    Wu, Qiushi
    Niu, Lechuan
    Zhang, Wenwei
    Li, Songbai
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (01) : 78 - 83
  • [10] Performance evaluation of cloud-based log file analysis with Apache Hadoop and Apache Spark
    Mavridis, Ilias
    Karatza, Helen
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 125 : 133 - 151