Reliable Distributed Real-time and Embedded Systems Through Safe Middleware Adaptation

被引:0
|
作者
Dabholkar, Akshay [1 ]
Dubey, Abhishek [1 ]
Gokhale, Aniruddha [1 ]
Karsai, Gabor [1 ]
Mahadevan, Nagabhushan [1 ]
机构
[1] Vanderbilt Univ, Dept EECS, Inst Software Integrated Syst, Nashville, TN 37235 USA
关键词
Middleware; Adaptation; Fault Tolerance; Real-time; Software Health Management; Profiling; FAULT-TOLERANCE;
D O I
10.1109/SRDS.2012.59
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed real-time and embedded (DRE) systems are a class of real-time systems formed through a composition of predominantly legacy, closed and statically scheduled real-time subsystems, which comprise over-provisioned resources to deal with worst-case failure scenarios. The formation of the system-of-systems leads to a new range of faults that manifest at different granularities for which no statically defined fault tolerance scheme applies. Thus, dynamic and adaptive fault tolerance mechanisms are needed which must execute within the available resources without compromising the safety and timeliness of existing real-time tasks in the individual subsystems. To address these requirements, this paper describes a middleware solution called Safe Middleware Adaptation for Real-Time Fault Tolerance (SafeMAT), which opportunistically leverages the available slack in the over-provisioned resources of individual subsystems. SafeMAT comprises three primary artifacts: (1) a flexible and configurable distributed, runtime resource monitoring framework that can pinpoint in real-time the available slack in the system that is used in making dynamic and adaptive fault tolerance decisions; (2) a safe and resource-aware dynamic failure adaptation algorithm that enables efficient recovery from different granularities of failures within the available slack in the execution schedule while ensuring real-time constraints are not violated and resources are not overloaded; and (3) a framework that empirically validates the correctness of the dynamic mechanisms and the safety of the DRE system. Experimental results evaluating SafeMAT on an avionics application indicates that SafeMAT incurs only 9-15% runtime failover and 2-6% processor utilization overheads thereby providing safe and predictable failure adaptability in real-time.
引用
收藏
页码:362 / 371
页数:10
相关论文
共 50 条
  • [21] μDDS: A Middleware for Real-time Wireless Embedded Systems
    Gonzalez, Apolinar
    Mata, W.
    Villasenor, L.
    Aquino, R.
    Simo, Jose
    Chavez, M.
    Crespo, A.
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2011, 64 (3-4) : 489 - 503
  • [22] A context-aware reflective middleware framework for distributed real-time and embedded systems
    Liu, Shengpu
    Cheng, Liang
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (02) : 205 - 218
  • [23] Adaptive techniques for minimizing middleware memory footprint for distributed, real-time, embedded systems
    Panahi, M
    Harmon, T
    Klefstad, R
    [J]. CCW 2003: IEEE 18TH ANNUAL WORKSHOP ON COMPUTER COMMUNICATIONS, PROCEEDINGS, 2003, : 54 - 58
  • [24] Model driven middleware: A new paradigm for developing distributed real-time and embedded systems
    Gokhale, Aniruddha
    Balasubramanian, Krishnakumar
    Krishna, Arvind S.
    Balasubramanian, Jaiganesh
    Edwards, George
    Deng, Gan
    Turkay, Emre
    Parsons, Jeffrey
    Schmidt, Douglas C.
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2008, 73 (01) : 39 - 58
  • [25] MIRTES: MIddleware for Real-time Transactions in Embedded Systems
    Alessandrelli, Daniele
    Pagano, Paolo
    Nastasi, Christian
    Petracca, Matteo
    Dragoni, Aldo Franco
    [J]. 3RD INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION, 2010, : 586 - 593
  • [26] Compadres: A lightweight component middleware framework for composing distributed real-time embedded systems with real-time Java']Java
    Hu, Jie
    Gorappa, Shruti
    Colmenares, Juan A.
    Klefstad, Raymond
    [J]. MIDDLEWARE 2007, PROCEEDINGS, 2007, 4834 : 41 - +
  • [27] Strategies for Reliable, Cloud-based Distributed Real-time and Embedded Systems
    An, Kyoungho
    [J]. 2012 31ST INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2012), 2012, : 483 - 484
  • [28] A real-time distributed scheduling service for middleware systems
    Zhang, JY
    DiPippo, L
    Fay-Wolfe, V
    Bryan, K
    Murphy, M
    [J]. WORDS 2005: 10TH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE, PROCEEDINGS, 2005, : 59 - 65
  • [29] Study on the Distributed Real-time and Embedded System Middleware Based on the DDS
    Ren Hao-li
    Jiao Yan-ping
    [J]. MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 7522 - +
  • [30] Supporting component-based failover units in middleware for distributed real-time and embedded systems
    Wolf, Friedhelm
    Balasubramanian, Jaiganesh
    Tambe, Sumant
    Gokhale, Aniruddha
    Schmidt, Douglas C.
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2011, 57 (06) : 597 - 613