Fault tolerant job scheduling in computational grid

被引:6
|
作者
Nazir, Babar [1 ]
Khan, Taimoor [1 ]
机构
[1] COMSATS Inst Informat Technol, Dept Comp Sci, Abbottabad, Pakistan
关键词
grid computing; grid scheduling; computational grid; job scheduling; fault tolerance; resource management;
D O I
10.1109/ICET.2006.335930
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale grids, the probability of a failure is much greater than in traditional parallel systems [1]. Therefore, fault tolerance has become a crucial area in grid computing. In this paper, we address the problem of fault tolerance in term of resource failure. We devise a strategy for fault tolerant job scheduling in computational grid. Proposed strategy maintains history of the fault occurrence of resource in Grid Information Service (GIS). Whenever a resource broker has job to schedule it uses the resource fault occurrence history information from GIS and depending on this information use different intensity of check pointing and replication while scheduling the job on resources which have different tendency towards fault. Using check pointing proposed scheme can make grid scheduling more reliable and efficient. Further, it increases the percentage of jobs executed within specified deadline and allotted budget, hence helping in making grid trustworthy. Through simulation we have evaluated the peformance of the proposed strategy. The experimental results demonstrate that proposed strategy effectively schedule the grid jobs in fault tolerant way in spite of highly dynamic nature of grid.
引用
收藏
页码:708 / +
页数:3
相关论文
共 50 条
  • [31] Designing a Dynamic Job Scheduling Strategy for Computational Grid
    Wangikar, Varsha
    Jain, Kavita
    Shah, Seema
    [J]. TECHNOLOGY SYSTEMS AND MANAGEMENT, 2011, 145 : 43 - +
  • [32] RFOH: A New Fault Tolerant Job Scheduler in Grid Computing
    Khanli, Leili Mohammad
    Far, Maryam Etminan
    Rahmani, Amir Masoud
    [J]. 2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 422 - 425
  • [33] A dependable task scheduling strategy for a fault tolerant grid model
    Wang, YZ
    Lin, C
    Zhai, ZL
    Yang, Y
    [J]. ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 534 - 539
  • [34] Dynamic fault tolerant scheduling policy for workflows in Grid computing
    Fatima, Kalfadj
    Belabbas, Yagoubi
    [J]. MULTIAGENT AND GRID SYSTEMS, 2016, 12 (04) : 287 - 302
  • [35] Fault Tolerant Scheduling of Workflows in Grid Computing Environment (FTSW)
    Srikala, K.
    Ramachandram, S.
    [J]. 2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 339 - 343
  • [36] A New Mechanism for Job Scheduling in Computational Grid Network Environments
    Malarvizhi, Nandagopal
    Uthariaraj, V. Rhymend
    [J]. ACTIVE MEDIA TECHNOLOGY, PROCEEDINGS, 2009, 5820 : 490 - 500
  • [37] Two level job-scheduling strategies for a computational GRID
    Tchernykh, Andrei
    Ramirez, Juan Manuel
    Avetisyan, Arutyun
    Kuzjurin, Nikolai
    Grushin, Dmitri
    Zhuk, Sergey
    [J]. PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 774 - 781
  • [38] Priority Based Heuristic Job Scheduling Algorithm For The Computational Grid
    Rajan, Rency
    Kamalam, G. K.
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 448 - 451
  • [39] Optimizing Job Scheduling for Computational Grid based on Firefly Algorithm
    Yousif, Adil
    Abdullah, Abdul Hanan
    Nor, Sulaiman Mohd
    Bashir, Mohammed Bakri
    [J]. 2012 IEEE CONFERENCE ON SUSTAINABLE UTILIZATION AND DEVELOPMENT IN ENGINEERING AND TECHNOLOGY (STUDENT), 2012, : 97 - 101
  • [40] Deadline Stringency based Job Scheduling in Computational Grid Environment
    Goswami, Sukalyan
    Das, Ajanta
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 531 - 536