A Framework for Automated Fault Recovery Planning in Large-Scale Virtualized Infrastructures

被引:0
|
作者
Liu, Feng [1 ]
Danciu, Vitalian A. [1 ]
Kerestey, Pavlo [2 ]
机构
[1] Univ Munich, Munich Network Management Team, Munich, Germany
[2] Tech Univ Munich, Munich, Germany
关键词
fault management; AI planning; virtualization; cloud computing;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-layered provisioning architectures such as those in emergent virtualized (e.g. cloud) infrastructures exacerbate the cost of faults to a degree where automation effectively constitutes a prerequisite for operations. The acquisition of management information and the execution of routine tasks have been automated to some degree; however the decision processes behind fault management in large-scale environments have not. This paper addresses automation of such decision processes by proposing a planning-based fault recovery algorithm based on hierarchical task networks and data models for the knowledge necessary to the recovery process. We embed these concepts in a generic architecture and evaluate its prototypical implementation with respect to function and scalability.
引用
收藏
页码:113 / +
页数:2
相关论文
共 50 条
  • [1] A Framework for Automated Collaborative Fault Detection in Large-Scale Vehicle Networks
    Maroli, John
    Ozguner, Umit
    Redmill, Keith
    [J]. 2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 1923 - 1927
  • [2] Attack containment framework for large-scale critical infrastructures
    Nguyen, Hoang
    Nahrstedt, Klara
    [J]. PROCEEDINGS - 16TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, VOLS 1-3, 2007, : 442 - 449
  • [3] Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-scale Datacenter
    Abar, Sameera
    Lemarinier, Pierre
    Theodoropoulos, Georgios K.
    O'Hare, Gregory M. P.
    [J]. 2014 IEEE 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2014, : 961 - 970
  • [4] Virtualized Disaster Recovery Model for Large-Scale Hospital and Healthcare Systems
    Lee, Olivia F.
    Guster, Dennis C.
    [J]. INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2010, 5 (03) : 69 - 81
  • [5] DISTRIBUTED AND OPTIMAL RESILIENT PLANNING OF LARGE-SCALE INTERDEPENDENT CRITICAL INFRASTRUCTURES
    Huang, Linan
    Chen, Juntao
    Zhu, Quanyan
    [J]. 2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 1096 - 1107
  • [6] Recording of multiple videos in a large-scale space for large-scale virtualized reality
    Kitahara, Itaru
    Ohta, Yuichi
    Saito, Hideo
    Akimichi, Shinji
    Ono, Tooru
    Kanade, Takeo
    [J]. Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2002, 56 (08): : 1328 - 1333
  • [7] PULSTORE: Automated storage management with QoS guarantee in large-scale virtualized storage systems
    Qiao, L
    Iyer, BR
    Agrawal, D
    El Abbadi, A
    Uttamchandani, S
    [J]. ICAC 2005: Second International Conference on Autonomic Computing, Proceedings, 2005, : 302 - 303
  • [8] Information technology planning framework for large-scale projects
    Peña-Mora, F
    Vadhavkar, S
    Perkins, E
    Weber, T
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 1999, 13 (04) : 226 - 237
  • [9] Fully Automated Cyclic Planning for Large-Scale Manufacturing Domains
    Asai, Masataro
    Fukunaga, Alex
    [J]. TWENTY-FOURTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2014, : 20 - 28
  • [10] AUTONOMOUS FAULT DETECTION AND RECOVERY SYSTEM IN LARGE-SCALE NETWORKS
    Memon, Raheel Ahmed
    Li, Jian Ping
    Shah, Fadia
    [J]. 2016 13TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2016, : 285 - 288