A resource manager for optimal resource selection and fault tolerance service in grids

被引:0
|
作者
Lee, HM [1 ]
Chin, SH [1 ]
Lee, JH [1 ]
Lee, DW [1 ]
Chung, KS [1 ]
Jung, SY [1 ]
Yu, HC [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Educ, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we address the issues of resource management and fault tolerance in Grids. In Grids, the state of the selected resources for job execution is a primary factor that determines the computing performance. Specifically, we propose a resource manager for optimal resource selection. The resource manager automatically selects the optimal resources among candidate resources using a genetic algorithm.. Typically, the probability of failure is higher in the grid computing than in a traditional parallel computing and the failure of resources affects job execution fatally. Therefore, a fault tolerance service is essential in computational grids and grid services are often expected to meet some minimum levels of Quality of Service (QoS) for desirable operation. To address this issue, we also propose fault tolerance service to satisfy QoS requirements. We extend the definition of failures, such as process failure, processor failure, and network failure, and design the fault detector and fault manager. The simulation results indicate that our approaches are promising in that (1) our resource manager finds the optimal set of resources that guarantees the optimal performance, (2) fault detector detects the occurrence of resource failures and (3) fault manager guarantees that the submitted jobs complete and improves the performance of job execution due to job migration even if some failures happen.
引用
收藏
页码:572 / 579
页数:8
相关论文
共 50 条
  • [1] A Guaranteed Service Resource Selection Framework for Computational Grids
    Kavitha, Ganesh
    Sankaranarayanan, V.
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2013, 6 (03): : 29 - 41
  • [2] A Framework for MGrid Resource Service Optimal-selection and Composition
    Tao, F.
    Zhang, L.
    Zhao, D.
    2009 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1-4, 2009, : 110 - 114
  • [3] An Optimal Service-Selection Model Based On Capability and Quality of Resource Service
    Zhang, Xiaodong
    Zhan Dechen
    Nie, Lanshun
    Zhao, Tianqi
    Xiong, Xiao
    PROCEEDINGS 2014 INTERNATIONAL CONFERENCE ON SERVICE SCIENCES (ICSS 2014), 2014, : 47 - 52
  • [4] The XtreemOS Resource Selection Service
    Stratan, Corina
    Sacha, Jan
    Napper, Jeff
    Costa, Paolo
    Pierre, Guillaume
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2012, 7 (04)
  • [5] Resource selection in grids using contract net
    Goswami, Kunal
    Gupta, Arobinda
    PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 105 - +
  • [6] On Construction of a MultiGrid Resource Selection Strategy on Grids
    Yang, Chao-Tung
    Hu, Wen-Jen
    Lai, Kuan-Chou
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2014, 6 (01) : 38 - 62
  • [7] Standardizing resource selection and access on computational grids
    Huashan, Yu
    Zhuoqun, Xu
    GCC 2005: FIFTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2006, : 416 - +
  • [8] Large-scale resource selection in grids
    Roumani, AM
    Skillicorn, DB
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2004: OTM 2004 WORKSHOPS, PROCEEDINGS, 2004, 3292 : 154 - 164
  • [9] Fault Aware Dynamic Resource Manager for Fault Recognition and Avoidance in Cloud
    Mohanram, Nandhini Jembu
    Thangavel, Gnanasekaran
    Swaroopan, N. M. Jothi
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 38 (02): : 215 - 228
  • [10] Fault Tolerant Resource Management Scheme for Computational Grids
    Kumar, Anuj
    Pathak, Heman
    INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 472 - 481