Fault tolerant job scheduling in computational grid

被引:6
|
作者
Nazir, Babar [1 ]
Khan, Taimoor [1 ]
机构
[1] COMSATS Inst Informat Technol, Dept Comp Sci, Abbottabad, Pakistan
关键词
grid computing; grid scheduling; computational grid; job scheduling; fault tolerance; resource management;
D O I
10.1109/ICET.2006.335930
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale grids, the probability of a failure is much greater than in traditional parallel systems [1]. Therefore, fault tolerance has become a crucial area in grid computing. In this paper, we address the problem of fault tolerance in term of resource failure. We devise a strategy for fault tolerant job scheduling in computational grid. Proposed strategy maintains history of the fault occurrence of resource in Grid Information Service (GIS). Whenever a resource broker has job to schedule it uses the resource fault occurrence history information from GIS and depending on this information use different intensity of check pointing and replication while scheduling the job on resources which have different tendency towards fault. Using check pointing proposed scheme can make grid scheduling more reliable and efficient. Further, it increases the percentage of jobs executed within specified deadline and allotted budget, hence helping in making grid trustworthy. Through simulation we have evaluated the peformance of the proposed strategy. The experimental results demonstrate that proposed strategy effectively schedule the grid jobs in fault tolerant way in spite of highly dynamic nature of grid.
引用
收藏
页码:708 / +
页数:3
相关论文
共 50 条
  • [1] Towards optimal fault tolerant scheduling in computational grid
    Imran, Muhammad
    Niaz, Iftikhar Azim
    Haider, Sajjad
    Hussain, Naveed
    Ansari, M. A.
    [J]. THIRD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2007, PROCEEDINGS, 2007, : 154 - +
  • [2] Robust Fault Tolerant Job Scheduling Approach In Grid Environment
    Balpande, Mangesh
    Shrawankar, Urmila
    [J]. 2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 259 - 264
  • [3] Fault-tolerant scheduling of independent tasks in computational grid
    Zheng, Qin
    Veeravalli, Bharadwaj
    Tham, Chen-Khong
    [J]. 2006 10TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2006, : 102 - +
  • [4] Component based proactive fault tolerant scheduling in computational grid
    Haider, Sajjad
    Imran, Muhammad
    Niaz, Iftikhar Azim
    Ullah, Saeed
    Ansari, M. A.
    [J]. THIRD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2007, PROCEEDINGS, 2007, : 119 - +
  • [5] Proactive Fault Tolerance Algorithm for Job Scheduling in Computational Grid
    Singh, Sarpreet
    Bawa, R. K.
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (03): : 135 - 143
  • [6] A fuzzy logic approach for secure and fault tolerant grid job scheduling
    Jiang, Congfeng
    Wang, Cheng
    Liu, Xiaohu
    Zhao, Yinghui
    [J]. AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS, 2007, 4610 : 549 - +
  • [7] Dynamic and Adaptive Fault Tolerant Scheduling With QoS Consideration in Computational Grid
    Haider, Sajjad
    Nazir, Babar
    [J]. IEEE ACCESS, 2017, 5 : 7853 - 7873
  • [8] An algorithm for online distributed fault-tolerant job scheduling in grid computing
    Zeng, Jun
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2021, 17 (04) : 389 - 407
  • [9] Replication based fault tolerant job scheduling strategy for economy driven grid
    Babar Nazir
    Kalim Qureshi
    Paul Manuel
    [J]. The Journal of Supercomputing, 2012, 62 : 855 - 873
  • [10] Replication based fault tolerant job scheduling strategy for economy driven grid
    Nazir, Babar
    Qureshi, Kalim
    Manuel, Paul
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 62 (02): : 855 - 873