A resource manager for optimal resource selection and fault tolerance service in grids

被引:0
|
作者
Lee, HM [1 ]
Chin, SH [1 ]
Lee, JH [1 ]
Lee, DW [1 ]
Chung, KS [1 ]
Jung, SY [1 ]
Yu, HC [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Educ, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we address the issues of resource management and fault tolerance in Grids. In Grids, the state of the selected resources for job execution is a primary factor that determines the computing performance. Specifically, we propose a resource manager for optimal resource selection. The resource manager automatically selects the optimal resources among candidate resources using a genetic algorithm.. Typically, the probability of failure is higher in the grid computing than in a traditional parallel computing and the failure of resources affects job execution fatally. Therefore, a fault tolerance service is essential in computational grids and grid services are often expected to meet some minimum levels of Quality of Service (QoS) for desirable operation. To address this issue, we also propose fault tolerance service to satisfy QoS requirements. We extend the definition of failures, such as process failure, processor failure, and network failure, and design the fault detector and fault manager. The simulation results indicate that our approaches are promising in that (1) our resource manager finds the optimal set of resources that guarantees the optimal performance, (2) fault detector detects the occurrence of resource failures and (3) fault manager guarantees that the submitted jobs complete and improves the performance of job execution due to job migration even if some failures happen.
引用
收藏
页码:572 / 579
页数:8
相关论文
共 50 条
  • [41] Optimal resource selection by using fuzzy numbers
    Aqeel, Muhammad
    Ansari, M. A.
    Affaan, Muhammad
    2007 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2007, : 192 - +
  • [42] Resource bartering in grids
    Özturan, C
    CONCURRENT INFORMATION PROCESSING AND COMPUTING, 2005, 195 : 83 - 95
  • [43] A NOVEL RESOURCE ORIENTED SERVICE SELECTION APPROACH FOR SERVICE COMPOSITION
    Sun, Liang
    Yang, Dong
    Liu, Chang
    Qin, Yajuan
    Zhang, Hongke
    2011 4TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK AND MULTIMEDIA TECHNOLOGY (4TH IEEE IC-BNMT2011), 2011, : 349 - 354
  • [44] Fault-tolerant resource discovery in peer-to-peer grids
    Merz P.
    Gorunova K.
    Journal of Grid Computing, 2007, 5 (3) : 319 - 335
  • [45] A task replication and fair resource management scheme for fault tolerant grids
    Litke, A
    Tserpes, K
    Dolkas, K
    Varvarigou, T
    ADVANCES IN GRID COMPUTING - EGC 2005, 2005, 3470 : 1022 - 1031
  • [46] A Model for the Storage Resource Manager
    Domenici, Andrea
    Donno, Flavia
    GRID COMPUTING, 2009, : 99 - +
  • [47] THE RESOURCE MANAGER IN PHYSICAL REHABILITATION
    EVANS, RL
    REHABILITATION LITERATURE, 1984, 45 (1-2) : 16 - 18
  • [48] Resource Manager for heterogeneous processors
    Aparicio-Santos, J. A.
    Benitez-Perez, H.
    Alvarez-Icaza, L.
    Mendoza-Rodriguez, L.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2024, 19 (04)
  • [49] DLFM: A transactional resource manager
    Hsiao, HI
    Narang, I
    SIGMOD RECORD, 2000, 29 (02) : 518 - 528
  • [50] THE FORSTMEISTER - RESOURCE MANAGER EXTRAORDINARY
    SHAFER, EL
    JOURNAL OF SOIL AND WATER CONSERVATION, 1980, 35 (02) : 72 - 75