Performance Evaluation of Hadoop-based Large-scale Network Traffic Analysis Cluster

被引:0
|
作者
Tao, Ran [1 ]
Qiao, Yuanyuan [1 ]
Zhou, Wenli [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst Architecture & Conve, Beijing 100876, Peoples R China
关键词
D O I
10.1051/matecconf/20165605015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Hadoop has gained popularity in big data era, it is widely used in various fields. The self-design and self-developed large-scale network traffic analysis cluster works well based on Hadoop, with off-line applications running on it to analyze the massive network traffic data. On purpose of scientifically and reasonably evaluating the performance of analysis cluster, we propose a performance evaluation system. Firstly, we set the execution times of three benchmark applications as the benchmark of the performance, and pick 40 metrics of customized statistical resource data. Then we identify the relationship between the resource data and the execution times by a statistic modeling analysis approach, which is composed of principal component analysis and multiple linear regression. After training models by historical data, we can predict the execution times by current resource data. Finally, we evaluate the performance of analysis cluster by the validated predicting of execution times. Experimental results show that the predicted execution times by trained models are within acceptable error range, and the evaluation results of performance are accurate and reliable.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Hadoop-Based Analysis for Large-Scale Click-Through Patterns in 4G Network
    Wang, Shuqiang
    Shen, Yanyan
    Hu, Jinxing
    Xuan, Zhe
    Lu, Zhe
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, 2015, 9204 : 829 - 835
  • [2] A Hadoop-Based Output Analyzer for Large-Scale Simulation Data
    Lee, Kangsun
    Park, Joonho
    [J]. 2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 197 - 200
  • [3] BioPig: a Hadoop-based analytic toolkit for large-scale sequence data
    Nordberg, Henrik
    Bhatia, Karan
    Wang, Kai
    Wang, Zhong
    [J]. BIOINFORMATICS, 2013, 29 (23) : 3014 - 3019
  • [4] HADOOP-BASED NETWORK TRAFFIC ANOMALY DETECTION IN BACKBONE
    Yu, Jishen
    Liu, Feng
    Zhou, Wenli
    Yu, Hua
    [J]. 2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS), 2014, : 140 - 145
  • [5] Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
    Liu, Jun
    Liu, Feng
    Ansari, Nirwan
    [J]. IEEE NETWORK, 2014, 28 (04): : 32 - 39
  • [6] An Efficient Hadoop-Based Framework for Data Storage and Fault Recovering in Large-Scale Multimedia Sensor Networks
    Saad, Ghina
    Harb, Hassan
    Abouaissa, Abdelhafid
    Idoumghar, Lhassane
    Charara, Nour
    [J]. 2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 316 - 321
  • [7] A Hadoop-Based Framework for Large-Scale Landmine Detection Using Ubiquitous Big Satellite Imaging Data
    El-Kazzaz, Sahar
    El-Mahdy, Ahmed
    [J]. 23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 274 - 278
  • [8] Designing a High Performance Cluster for Large-Scale SQL-on-Hadoop Analytics
    Dholakia, Ajay
    Venkatachar, Prasad
    Doshi, Kshitij
    Durgavajhala, Ravikanth
    Tate, Stewart
    Schiefer, Berni
    Sheard, Matthew
    Sagar, Ramnath Sai
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1701 - 1703
  • [9] Performance Evaluation of a MapReduce Hadoop-based Implementation for Processing Large Virtual Campus Log Files
    Xhafa, Fatos
    Garcia, Daniel
    Ramirez, Daniel
    Caballe, Santi
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC), 2015, : 200 - 206
  • [10] Network Traffic Analysis Based on Hadoop
    Yang, Jie
    He, Haiyang
    Qiao, Yuanyuan
    [J]. 2014 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, VEHICULAR TECHNOLOGY, INFORMATION THEORY AND AEROSPACE & ELECTRONIC SYSTEMS (VITAE), 2014,