Run-Time Root Cause Analysis in Adaptive Distributed Systems

被引:0
|
作者
Raj, Amit [1 ]
Barrett, Stephen [1 ]
Clarke, Siobhan [1 ]
机构
[1] Univ Dublin Trinity Coll, Sch Comp Sci & Stat, Dublin 2, Ireland
关键词
Distributed Systems; Root cause analysis; Fault causal relationship; adaptive system; component-based system; STATECHARTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a distributed environment, several components collaborate with each other to cater a complex functionality. Adaptation in distributed systems is one of the emerging trends that re-configures itself through components addition/removal/update, to cope up with faults. Components are generally inter-dependent, thus a fault propagates from one component to another. Existing root cause analysis techniques generally create a static faults' dependencies graph to identify the root fault. However, these dependencies keep on changing with adaptations that makes design-time fault dependencies invalid at run-time. This paper describes the problem of deriving causal relationships of faults in adaptive distributed systems. Then, presents a statechart-based solution that statically identifies the sequence of methods execution to derive the causal relationships of faults at run-time. The approach is evaluated, and found that it is highly scalable and time efficient that can be used to reduce the Mean Time To Recover (MTTR) of a distributed system.
引用
收藏
页码:292 / 301
页数:10
相关论文
共 50 条
  • [1] CONVERGENCE ANALYSIS OF RUN-TIME DISTRIBUTED OPTIMIZATION ON ADAPTIVE SYSTEMS USING GAME THEORY
    Puschini, Diego
    Clermidy, Fabien
    Benoit, Pacal
    Sassatelli, Gilles
    Torres, Lionel
    [J]. 2008 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE AND LOGIC APPLICATIONS, VOLS 1 AND 2, 2008, : 554 - +
  • [2] Towards Run-Time Testing of Dynamic Adaptive Systems
    Fredericks, Erik M.
    Ramirez, Andres J.
    Cheng, Betty H. C.
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS 2013), 2013, : 169 - 174
  • [3] Root-Cause Analysis and Fine Tuning for Run-time Quality Issues in Transboundary Services
    Ma, Chao
    Liu, Weidong
    Li, Weifeng
    Pan, Cheng
    Tu, Zhiying
    Wang, Zhongjie
    [J]. 2020 IEEE WORLD CONGRESS ON SERVICES (SERVICES), 2020, : 213 - 218
  • [5] Run-time analysis of time-critical systems
    Zhou, SK
    Zedan, H
    Cau, A
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2005, 51 (05) : 331 - 345
  • [6] An Empirical Analysis of the Mutation Operator for Run-Time Adaptive Testing in Self-Adaptive Systems
    Fredericks, Erik M.
    [J]. PROCEEDINGS 2018 IEEE/ACM 11TH INTERNATIONAL WORKSHOP ON SEARCH-BASED SOFTWARE TESTING (SBST), 2018, : 59 - 66
  • [7] DRASTIC: A run-time architecture for evolving, distributed, persistent systems
    Evans, H
    Dickman, P
    [J]. ECOOP'97: OBJECT-ORIENTED PROGRAMMING, 1997, 1241 : 243 - 275
  • [8] Interoperable Run-Time Tools for Distributed Systems—A Case Study
    Roland Wismüller
    Thomas Ludwig
    [J]. The Journal of Supercomputing, 2000, 17 : 277 - 289
  • [9] Interoperable run-time tools for distributed systems -: A case study
    Wismüller, R
    Ludwig, T
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, PROCEEDINGS, 1999, : 1763 - 1769
  • [10] Interoperable run-time tools for distributed systems -: A case study
    Wismüller, R
    Ludwig, T
    [J]. JOURNAL OF SUPERCOMPUTING, 2000, 17 (03): : 277 - 289