Fault tolerance using "Parallel shadow image servers (PSIS)" in grid based computing environment

被引:3
|
作者
Hussain, Naveed [1 ]
Ansari, M. A.
Yasin, M. M.
Rauf, Abdul
Haider, Sajjad
机构
[1] Natl Univ Modern Languages, Dept Informat Technol, Islamabad, Pakistan
[2] Fed Urdu Univ Arts Sci & Technol, Dept Comp Sci, Islamabad, Pakistan
[3] COMSATS Inst Informat Technol, Dept Comp Sci, Islamabad, Pakistan
关键词
grid computing; fault tolerance; PSIS; condor; cactus; job scheduling;
D O I
10.1109/ICET.2006.335982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper will present a critical review, of the existing fault tolerance mechanism in grid computing and the overhead involved in terms of reprocessing or rescheduling of jobs, if in case a fault arisen For this purpose we suggested the Parallel Shadow Image Server (PSIS) copying techniques in parallel to the Resource Manager for having the check points for rescheduling of jobs from the nearest flag, if in case the fault is detected. The job process is to be scheduled from the resource manager node to the worker nodes and then its' submitted back by the worker nodes in serialized form to the Parallel Shadow Image Servers from the worker nodes after the pre-specified amount of time, which we call the recent spawn or the flag check point for rescheduling or reprocessing of job. If the fault is arisen then the rescheduling will be done from the recent check point and will be submitted to the worker rode from where the job was terminated. This will not only save time but will improve the performance up to major extent.
引用
收藏
页码:703 / 707
页数:5
相关论文
共 50 条
  • [31] Epidemic Fault Tolerance for Extreme-Scale Parallel Computing
    Katti, Amogh
    Di Fatta, Giuseppe
    INTERNET AND DISTRIBUTED COMPUTING SYSTEMS, IDCS 2015, 2015, 9258 : 201 - 208
  • [32] Volunteer availability based fault tolerant scheduling mechanism in desktop grid computing environment
    Choi, SJ
    Baik, MS
    Hwang, CS
    Gil, JM
    Yu, HC
    THIRD IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS, PROCEEDINGS, 2004, : 366 - 371
  • [33] Fault Tolerant Scheduling of Workflows in Grid Computing Environment (FTSW)
    Srikala, K.
    Ramachandram, S.
    2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 339 - 343
  • [34] A Survey on QOS and Fault Tolerance based Service Scheduling Techniques in Fog Computing Environment
    Raghavendra, M. Sri
    Chawla, Priyanka
    2018 7TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO) (ICRITO), 2018, : 365 - 372
  • [35] Predictive analysis-based load balancing and fault tolerance in fog computing environment
    Vijaita Kashyap
    Rakesh Ahuja
    Ashok Kumar
    Cluster Computing, 2025, 28 (5)
  • [36] Dynamic simulation for spherical parallel manipulators in grid computing environment
    Zhang, Jun-Fu
    Xu, Li-Ju
    Wang, Jie
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2007, 39 (05): : 164 - 170
  • [37] Rescue Robot Navigation Parallel Algorithm in Grid Computing Environment
    Wang, Wei
    Shan, Xinjian
    Jia, Shenjie
    ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING, PTS 1-3, 2011, 271-273 : 114 - +
  • [38] A parallel application programming and processing environment proposal for grid computing
    Gomes Junior, Augusto Mendes
    Sato, Liria Matsumoto
    Massetto, Francisco Isidro
    15TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2012) / 10TH IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2012), 2012, : 154 - 161
  • [39] A drug discovery grid environment with fault-tolerance support
    Wang, Yongjian
    Ren, Yinan
    Chen, Ting
    Huang, Yuanqiang
    Yu, Kunqian
    Luan, Zhongzhi
    Jiang, Hualiang
    Qian, Depei
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2009, 43 (12): : 21 - 25
  • [40] Enabling parallel TMA image analysis in a grid environment
    Galizia, Antonella
    D'Agostino, Daniele
    Clematis, Andrea
    Viti, Federica
    Orro, Alessandro
    Merelli, Ivan
    Milanesi, Luciano
    CISIS 2008: THE SECOND INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, PROCEEDINGS, 2008, : 394 - +