Performance Analysis Using Petri Net Based MapReduce Model in Heterogeneous Clusters

被引:2
|
作者
Cheng, Sheng-Tzong [1 ]
Wang, Hsi-Chuan [1 ]
Chen, Yin-Jun [1 ]
Chen, Chen-Fei [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
[2] Shih Hsin Univ, Dept Informat & Communicat, Taipei, Taiwan
来源
关键词
Cloud computing; MapReduce; Petri net; Performance analysis;
D O I
10.1007/978-3-662-46315-4_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, big data and large-scale data processing techniques has become an important developing area. MapReduce is an enabling technology of cloud computing. Hadoop is one of the most popular MapReduce implementation, which is the target platform in this paper. When running a MapReduce job, programmers however cannot acquire the information about how to fine-tune the parameters of application. Moreover, programmers need much time on finding the most suitable parameters. This paper evaluates execution processes in MapReduce and form SPN-MR model with Stochastic Petri Net. In order to analyze the performance of SPN-MR, formulas of mean delay time in each time transition are defined. SPN-MR simulates the elapsed time of any MapReduce jobs with known input data sizes and then reduces time cost in performance tuning. SPN-MR carried out several actual test benchmarks. The results showed the average error rate is within 5 percent. Therefore, it can provide effective performance evaluation reports for MapReduce programmers.
引用
收藏
页码:170 / 179
页数:10
相关论文
共 50 条