An agent oriented proactive fault-tolerant framework for grid computing

被引:7
|
作者
Huda, MT [1 ]
Schmidt, HW [1 ]
Peake, ID [1 ]
机构
[1] Monash Univ, Ctr Distributed Syst & Software Engn, Melbourne, Vic 3004, Australia
关键词
D O I
10.1109/E-SCIENCE.2005.15
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Because of computational grid heterogeneity, scale and complexity, faults become likely. Therefore, grid infrastructure must have mechanisms to deal with faults while also providing efficient and reliable services to its end users. Existing fault-tolerant approaches are inefficient because they are reactive and incomplete. They are reactive because they only deal with faults when they take place; they are incomplete because they only deal with certain types of faults. Proactive approaches increase efficiency by reducing the cost and time of operations and network resource usage by maintaining the state of executing applications and resuming operation when rescheduled. This paper presents an agent oriented, fault-tolerant grid framework where agents deal wit h individual faults proactively. Agents maintain information about hardware conditions, executing process memory consumption, available resources, network conditions and component mean time to failure. Based on this information and critical states, agent can improve the reliability and efficiency of grid services.
引用
下载
收藏
页码:304 / 311
页数:8
相关论文
共 50 条
  • [1] FAULT-TOLERANT COMPUTING
    TOY, WN
    ADVANCES IN COMPUTERS, 1987, 26 : 201 - 279
  • [2] FAULT-TOLERANT COMPUTING
    PRADHAN, DK
    COMPUTER, 1980, 13 (03) : 6 - 7
  • [3] DARX - A framework for the fault-tolerant support of agent software
    Marin, O
    Bertier, M
    Sens, P
    ISSRE 2003: 14TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 2003, : 406 - 416
  • [4] A fault-tolerant multi-agent development framework
    Wang, L
    Li, HF
    Goswami, D
    Wei, ZC
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2004, 3358 : 126 - 135
  • [5] GRIDTS A new approach for fault-tolerant scheduling in grid computing
    Favarim, Fabio
    Fraga, Joni da Silva
    Lung, Lau Cheuk
    Correia, Miguel
    SIXTH IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS, PROCEEDINGS, 2007, : 187 - +
  • [6] Proactive Fortification of Fault-Tolerant Services
    Ezhilchelvan, Paul
    Clarke, Dylan
    Mitrani, Isi
    Shrivastava, Santosh
    PRINCIPLES OF DISTRIBUTED SYSTEMS, PROCEEDINGS, 2009, 5923 : 330 - 344
  • [7] PreGAN: Preemptive Migration Prediction Network for Proactive Fault-Tolerant Edge Computing
    Tuli, Shreshth
    Casale, Giuliano
    Jennings, Nicholas R.
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022), 2022, : 670 - 679
  • [8] Migol: A fault-tolerant service framework for MPI applications in the grid
    Luckow, A
    Schnor, B
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2005, 3666 : 258 - 267
  • [9] Migol: A fault-tolerant service framework for MPI applications in the grid
    Luckow, Andre
    Schnor, Bettina
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (02): : 142 - 152
  • [10] A framework for ABFT techniques in the design of fault-tolerant computing systems
    Hodjat Hamidi
    Abbas Vafaei
    Seyed Amirhassan Monadjemi
    EURASIP Journal on Advances in Signal Processing, 2011