A Fault Avoidance Strategy Improving the Reliability of the EGI Production Grid Infrastructure

被引:0
|
作者
Palmieri, Francesco [1 ]
Pardi, Silvio [2 ]
Veronesi, Paolo [3 ]
机构
[1] Univ Naples Federico II, Via Cinthia 5, I-80126 Naples, Italy
[2] INFN Istituto Nazionale Di Fisica Nucleare, INDAM, I-80126 Naples, Italy
[3] INFN CNAF, I-40127 Bologna, Italy
来源
关键词
Reliability; Fault Avoidance; Monitoring; Resource Management; COMPUTING SYSTEMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reliability is a crucial issue for the development of stable and effective production grid infrastructures. That is, grid users must be able to trust upon the runtime service they request and receive from the underlying grid. Many runtime services and capabilities offered by modern Grid infrastructures are not available in advance to the application developers and dynamically bound only at the execution time, leading to an increased incidence of interaction faults. In this work we propose, implement and evaluate a novel low-impact fault-avoidance scheme, specifically conceived to improve the grid reliability from the user/application point of view, by providing proper service status information to the workload management system. In particular, starting from the EGEE experience, we designed a strategy inhibiting the use of some specific runtime capabilities on the available resources as soon as the monitoring system detect any anomalous behavior associated to these capabilities and re-integrating them when they restart to correctly work again. The results of a significant set of tests ran on the production EGEE infrastructure, have been presented to show the effectiveness of our approach.
引用
收藏
页码:159 / +
页数:3
相关论文
共 50 条
  • [1] The European Grid Infrastructure (EGI): Current Status and Perspectives for Astronomy and Astrophysics within EGI
    Vuerli, Claudio
    Taffoni, Giuliano
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXI, 2012, 461 : 815 - +
  • [2] Improving the reliability of infrastructure
    Tranfield, D
    Denyer, D
    PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-CIVIL ENGINEERING, 2003, 156 (02) : 56 - 56
  • [3] Enabling Large Scale Data Production for OpenDose with GATE on the EGI Infrastructure
    Chauvin, Maxims
    Mathieu, Gilles
    Camarasu-Pop, Sorina
    Bonnet, Axel
    Bardies, Manuel
    Perseil, Isabelle
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 658 - 665
  • [4] Migration to the GLUE 2.0 information schema in the LCG/EGEE/EGI production Grid
    Burke, Stephen
    Field, Laurence
    Horat, David
    INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2010), 2011, 331
  • [5] A Novel Fault Diagnostic Strategy for PV Micro Grid to Achieve Reliability Centered Maintenance
    Rao, K. Uma
    Parvatikar, Akash G.
    Gokul, S.
    Nitish, N.
    Rao, Pramod
    PROCEEDINGS OF THE FIRST IEEE INTERNATIONAL CONFERENCE ON POWER ELECTRONICS, INTELLIGENT CONTROL AND ENERGY SYSTEMS (ICPEICES 2016), 2016,
  • [6] Improving Power Grid Stability With Communication Infrastructure
    Pavlovski, Martin
    Gajduk, Andrej
    Todorovski, Mirko
    Kocarev, Ljupco
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2017, 7 (03) : 349 - 358
  • [7] ADAPTIVE FAULT DETECTION STRATEGY IN GRID
    Liang, Hong
    Qi, Xin
    Gao, Yuantao
    CIICT 2008: PROCEEDINGS OF CHINA-IRELAND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATIONS TECHNOLOGIES 2008, 2008, : 245 - 248
  • [8] Smart Grid Reliability Computation - A Solution to Ageing Infrastructure in Power Grid Networks
    Onoshakpor, Rapheal
    Okafor, K. C.
    Gabriel, Modukpe
    2022 IEEE NIGERIA 4TH INTERNATIONAL CONFERENCE ON DISRUPTIVE TECHNOLOGIES FOR SUSTAINABLE DEVELOPMENT (IEEE NIGERCON), 2022, : 626 - 630
  • [9] IMPROVING THE RELIABILITY OF BUS SYSTEMS - FAULT ISOLATION AND FAULT TOLERANCE
    VOGT, R
    MICROPROCESSING AND MICROPROGRAMMING, 1987, 21 (1-5): : 333 - 338
  • [10] The NorduGrid production Grid infrastructure, status and plans
    Eerola, P
    Kónya, B
    Smirnova, O
    Ekelöf, T
    Ellert, M
    Hansen, JR
    Nielsen, JL
    Wäänänen, A
    Konstantinov, A
    Herrala, J
    Tuisku, M
    Myklebust, T
    Ould-Saada, F
    Vinter, B
    FOURTH INTERNATIONAL WORKSHOP ON GRID COMPUTING, PROCEEDINGS, 2003, : 158 - 165