Performance Improvement of MapReduce Framework by Identifying Slow TaskTrackers in Heterogeneous Hadoop Cluster

被引:0
|
作者
Naik, Nenavath Srinivas [1 ]
Negi, Atul [1 ]
Sastry, V. N. [2 ]
机构
[1] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad 500046, Andhra Pradesh, India
[2] Inst Dev & Res Banking Technol, Hyderabad 500057, Andhra Pradesh, India
关键词
Hadoop; MapReduce; Job scheduling; TaskTracker; Heterogeneous environments;
D O I
10.1007/978-81-322-2529-4_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
MapReduce is presently recognized as a significant parallel and distributed programming model with wide acclaim for large scale computing. MapReduce framework divides a job into map, reduce tasks and schedules these tasks in a distributed manner across the cluster. Scheduling of tasks and identification of "slow TaskTrackers" in heterogeneous Hadoop clusters is the focus of recent research. MapReduce performance is currently limited by its default scheduler, which does not adapt well in heterogeneous environments. In this paper, we propose a scheduling method to identify "slow TaskTrackers" in a heterogeneous Hadoop cluster and implement the proposed method by integrating it with the Hadoop default scheduling algorithm. The performance of this method is compared with the Hadoop default scheduler. We observe that the proposed approach shows modest but consistent improvement against the default Hadoop scheduler in heterogeneous environments. We see that it improves by minimizing the overall job execution time.
引用
收藏
页码:465 / 473
页数:9
相关论文
共 31 条