A resource manager for optimal resource selection and fault tolerance service in grids

被引:0
|
作者
Lee, HM [1 ]
Chin, SH [1 ]
Lee, JH [1 ]
Lee, DW [1 ]
Chung, KS [1 ]
Jung, SY [1 ]
Yu, HC [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Educ, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we address the issues of resource management and fault tolerance in Grids. In Grids, the state of the selected resources for job execution is a primary factor that determines the computing performance. Specifically, we propose a resource manager for optimal resource selection. The resource manager automatically selects the optimal resources among candidate resources using a genetic algorithm.. Typically, the probability of failure is higher in the grid computing than in a traditional parallel computing and the failure of resources affects job execution fatally. Therefore, a fault tolerance service is essential in computational grids and grid services are often expected to meet some minimum levels of Quality of Service (QoS) for desirable operation. To address this issue, we also propose fault tolerance service to satisfy QoS requirements. We extend the definition of failures, such as process failure, processor failure, and network failure, and design the fault detector and fault manager. The simulation results indicate that our approaches are promising in that (1) our resource manager finds the optimal set of resources that guarantees the optimal performance, (2) fault detector detects the occurrence of resource failures and (3) fault manager guarantees that the submitted jobs complete and improves the performance of job execution due to job migration even if some failures happen.
引用
收藏
页码:572 / 579
页数:8
相关论文
共 50 条
  • [21] Study on manufacturing grid & its resource service optimal-selection system
    Fei Tao
    Ye Fa Hu
    Zu De Zhou
    The International Journal of Advanced Manufacturing Technology, 2008, 37 : 1022 - 1041
  • [22] Study on manufacturing grid & its resource service optimal-selection system
    Tao, Fei
    Hu, Ye Fa
    Zhou, Zu De
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2008, 37 (9-10): : 1022 - 1041
  • [23] Active isochronous resource manager for intense dynamic resource allocation service with IEEE1394
    Jung, GH
    Kang, SJ
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2005, 51 (02) : 501 - 506
  • [24] Lineage Resource Manager
    Kaushik, Sughosh, V
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2530 - 2532
  • [25] RESOURCE MANAGER OF TOMORROW
    DROEGE, R
    JOURNAL OF SOIL AND WATER CONSERVATION, 1968, 23 (06) : 209 - &
  • [26] THE RESOURCE MANAGER IN TRANSITION
    VANMETER, DE
    JOURNAL OF SOIL AND WATER CONSERVATION, 1988, 43 (02) : 116 - 116
  • [27] Correlation-aware resource service composition and optimal-selection in manufacturing grid
    Tao, Fei
    Zhao, Dongming
    Hu Yefa
    Zhou, Zude
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (01) : 129 - 143
  • [28] A resource management and fault tolerance services in grid computing
    Lee, HM
    Chung, KS
    Chin, SH
    Lee, JH
    Lee, DW
    Park, S
    Yu, HC
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2005, 65 (11) : 1305 - 1317
  • [29] Resource Reliability using Fault Tolerance in Cloud Computing
    Charity, Talwana Jonathan
    Hua, Gu Chun
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 65 - 71
  • [30] How does resource utilization affect fault tolerance?
    Scherrer, C
    Steininger, A
    IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2000, : 251 - 256