An efficient grid scheduling strategy for data parallel applications

被引:6
|
作者
Khan, Kashif Hesham [1 ]
Qureshi, Kalim [2 ]
Abd-El-Barr, Mostafa [2 ]
机构
[1] COMSATS Inst Informat Technol, Abbottabad, Pakistan
[2] Kuwait Univ, Dept Informat Sci, Kuwait, Kuwait
来源
JOURNAL OF SUPERCOMPUTING | 2014年 / 68卷 / 03期
关键词
Grid computing; Task partitioning; Task scheduling; Load balancing; Data parallel applications; Grid scheduling;
D O I
10.1007/s11227-014-1114-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scheduling large-scale application in heterogeneous grid systems is a fundamental NP-complete problem that is critical to obtain good performance and execution cost. To achieve high performance in a grid system it requires effective task partitioning, resource management and load balancing. The heterogeneous and dynamic nature of a grid, as well as the diverse demands of applications running on the grid, makes grid scheduling a major task. Existing schedulers in wide-area heterogeneous systems require a large amount of information about the application and the grid environment to produce reasonable schedules. However, this required information may not be available, may be too expensive to collect, or may increase the runtime overhead of the scheduler such that the scheduler is rendered ineffective. We believe that no one scheduler is appropriate for all grid systems and applications. This is because while data parallel applications in which further data partitioning is possible can be further improved by efficient management of resources, smart selection of resources and load balancing can be possible, in functional/not-dividable-task parallel applications such partitioning is either not possible or difficult or expensive in term of performance. In this paper, we propose a scheduler for data parallel applications (SDPA) which offers an efficient task partitioning and load balancing strategy for data parallel applications in grid environment. The proposed SDPA offers two major features: maintaining job priority even if insufficient number of free resources is available and pre-task assignment to cut the idle time of nodes. The SDPA selects nodes smartly according to the nature of task and the nodes' resources availability. Simulation results conducted reveal that SDPA achieves performance improvement over reported strategies in the reviewed literature in terms of execution time, throughput and waiting time.
引用
收藏
页码:1487 / 1502
页数:16
相关论文
共 50 条
  • [1] An efficient grid scheduling strategy for data parallel applications
    Kashif Hesham Khan
    Kalim Qureshi
    Mostafa Abd-El-Barr
    [J]. The Journal of Supercomputing, 2014, 68 : 1487 - 1502
  • [2] Efficient and dynamic Parallel Job Scheduling for bioinformatics Data Management in Data Grid Environment
    Kumar, K. Ashok
    Chandrasekar, C.
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2015, 6 (03): : 1492 - 1501
  • [3] Performance modeling of parallel applications for grid scheduling
    Sanjay, H. A.
    Vadhiyar, Sathish
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (08) : 1135 - 1145
  • [4] A Threshold-based Dynamic Data Replication and Parallel Job Scheduling strategy to enhance Data Grid
    N. Mansouri
    [J]. Cluster Computing, 2014, 17 : 957 - 977
  • [5] A Threshold-based Dynamic Data Replication and Parallel Job Scheduling strategy to enhance Data Grid
    Mansouri, N.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (03): : 957 - 977
  • [6] A Novel Parallel Transmission Strategy for Data Grid
    Ming-Cheng, Qu
    Xiang-Hu, Wu
    Xiao-Zong, Yang
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2011, 6 (04) : 683 - 702
  • [7] A new replication scheduling strategy for grid workflow applications
    Tao M.
    Dong S.
    He K.
    [J]. Proceedings - 2011 6th Annual ChinaGrid Conference, ChinaGrid 2011, 2011, : 74 - 80
  • [8] A scheduling middleware for data intensive applications on a grid
    Lee, Moo-hun
    In, Jang-uk
    Choi, Eui-in
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2006, 4253 : 1058 - 1067
  • [9] An efficient parallel loop self-scheduling on grid environments
    Yang, CT
    Cheng, KW
    Li, KC
    [J]. NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2004, 3222 : 92 - 100
  • [10] Scheduling Strategy to Minimize Makespan for Energy-Efficient Parallel Applications in Heterogeneous Computing Systems
    Cheng, Lin
    Wu, Jing
    Hu, Wei
    Li, Haodi
    Chen, Ziyu
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 166 - 178