Improving job scheduling performance with parallel access to replicas in Data Grid environment

被引:2
|
作者
Zhang, Junwei [1 ]
Lee, Bu-Sung [1 ]
Tang, Xueyan [1 ]
Yeo, Chai-Kiat [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
来源
JOURNAL OF SUPERCOMPUTING | 2011年 / 56卷 / 03期
关键词
Data Grid; Data Replication; Parallel download; Job scheduling;
D O I
10.1007/s11227-009-0365-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data Grid has evolved to be the solution for data-intensive applications, such as High Energy Physics (HEP), astrophysics, and computational genomics. These applications usually have large input of data to be analyzed and these input data are widely replicated across Data Grid to improve the performance. The job scheduling performance on traditional computing jobs can be studied using queuing theory. However, with the addition of data transfer, the job scheduling performance is too complex to be modeled. In this research, we study the impact of data transfer on the performance of job scheduling in the Data Grid environment. We have proposed a parallel downloading system that supports replicating data fragments and parallel downloading of replicated data fragments, to improve the job scheduling performance. The performance of the parallel downloading system is compared with non-parallel downloading system, using three scheduling heuristics: Shortest Turnaround Time (STT), Least Relative Load (LRL) and Data Present (DP). Our simulation results show that the proposed parallel download approach greatly improves the Data Grid performance for all three scheduling algorithms, in terms of the geometric mean of job turnaround time. The advantage of parallel downloading system is most evident when the Data Grid has relatively low network bandwidth and relatively high computing power.
引用
收藏
页码:245 / 269
页数:25
相关论文
共 50 条
  • [1] Improving job scheduling performance with parallel access to replicas in Data Grid environment
    Junwei Zhang
    Bu-Sung Lee
    Xueyan Tang
    Chai-Kiat Yeo
    [J]. The Journal of Supercomputing, 2011, 56 : 245 - 269
  • [2] Impact of Parallel Download on Job Scheduling in Data Grid Environment
    Zhang, Junwei
    Lee, Bu-Sung
    Tang, Xueyan
    Yeo, Chai-Kiat
    [J]. GCC 2008: SEVENTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2008, : 102 - 109
  • [3] Improving job scheduling algorithms in a grid environment
    Lee, Yun-Han
    Leu, Seiven
    Chang, Ruay-Shiung
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (08): : 991 - 998
  • [4] Efficient and dynamic Parallel Job Scheduling for bioinformatics Data Management in Data Grid Environment
    Kumar, K. Ashok
    Chandrasekar, C.
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2015, 6 (03): : 1492 - 1501
  • [5] Job scheduling and dynamic data replication in data grid environment
    Mansouri, Najme
    Dastghaibyfard, Gholam Hosein
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 64 (01): : 204 - 225
  • [6] Job scheduling and dynamic data replication in data grid environment
    Najme Mansouri
    Gholam Hosein Dastghaibyfard
    [J]. The Journal of Supercomputing, 2013, 64 : 204 - 225
  • [7] An Efficient Evolutionary Scheduling Algorithm for Parallel Job Model in Grid Environment
    Switalski, Piotr
    Seredynski, Franciszek
    [J]. PARALLEL COMPUTING TECHNOLOGIES, 2011, 6873 : 347 - +
  • [8] The impact of data replication on job scheduling performance in the Data Grid
    Tang, M
    Lee, BS
    Tang, XY
    Yeo, CK
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2006, 22 (03): : 254 - 268
  • [9] A Performance Optimization of Job Scheduling Model Based on Grid Environment
    Lee, Chong-Yen
    Lee, Tsang-Yean
    Wu, Homer
    Tsui, Hau-Dong
    Huang, Jiun-Bo
    [J]. ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 768 - 773
  • [10] Optimal Number of Replicas in Data Grid Environment
    Mansouri, Yasser
    Garmehi, Mehran
    Sargolzaei, Mahdi
    Shadi, Mahdieh
    [J]. DFMA 2008: FIRST INTERNATIONAL CONFERENCE ON DISTRIBUTED FRAMEWORKS & APPLICATIONS, PROCEEDINGS, 2008, : 96 - +