Resilience in Large Scale Distributed Systems

被引:4
|
作者
Matni, Nikolai [1 ]
Leong, Yoke Peng [1 ]
Wang, Yuh Shyang [1 ]
You, Seungil [1 ]
Horowitz, Matanya B. [1 ]
Doyle, John C. [1 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
关键词
large scale; distributed; resilient; control theory; convex optimization; tradeoffs; fundamental limits; layered architecture;
D O I
10.1016/j.procs.2014.03.036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed systems are comprised of multiple subsystems that interact in two distinct ways: (1) physical interactions and (2) cyber interactions; i.e. sensors, actuators and computers controlling these subsystems, and the network over which they communicate. A broad class of cyber-physical systems (CPS) are described by such interactions, such as the smart grid, platoons of autonomous vehicles and the sensorimotor system. This paper will survey recent progress in developing a coherent mathematical framework that describes the rich CPS "design space" of fundamental limits and tradeoffs between efficiency, robustness, adaptation, verification and scalability. Whereas most research treats at most one of these issues, we attempt a holistic approach in examining these metrics. In particular, we will argue that a control architecture that emphasizes scalability leads to improvements in robustness, adaptation, and verification, all the while having only minor effects on efficiency i.e. through the choice of a new architecture, we believe that we are able to bring a system closer to the true fundamental hard limits of this complex design space. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:285 / 293
页数:9
相关论文
共 50 条
  • [11] Stability of large-scale distributed parameter systems
    Ladde, GS
    Li, TT
    [J]. DYNAMIC SYSTEMS AND APPLICATIONS, 2002, 11 (03): : 311 - 323
  • [12] Energy efficiency in large-scale distributed systems
    Tuan Anh Trinh
    Hlavacs, Helmut
    Talia, Domenico
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING AND ESCIENCE, 2012, 28 (05): : 743 - 744
  • [13] Monitoring and control of large-scale distributed systems
    Legrand, C.
    [J]. GRID AND CLOUD COMPUTING: CONCEPTS AND PRACTICAL APPLICATIONS, 2016, 192 : 101 - 151
  • [14] Analysis of large-scale distributed information systems
    Hellerstein, JL
    Jayram, TS
    Squillante, MS
    [J]. 8TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, PROCEEDINGS, 2000, : 164 - 171
  • [15] Distributed resource discovery in large scale computing systems
    Gupta, A
    Agrawal, D
    El Abbadi, A
    [J]. 2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 320 - 326
  • [16] Robustness of large-scale distributed computer systems
    Khoroshevsky, VG
    [J]. EUROSIM '96 - HPCN CHALLENGES IN TELECOMP AND TELECOM: PARALLEL SIMULATION OF COMPLEX SYSTEMS AND LARGE-SCALE APPLICATIONS, 1996, : 141 - 150
  • [17] Distributed Orchestration in Large-scale IoT Systems
    Yigitoglu, Emre
    Liu, Ling
    Looper, Margaret
    Pu, Calton
    [J]. 2017 IEEE 2ND INTERNATIONAL CONGRESS ON INTERNET OF THINGS (IEEE ICIOT), 2017, : 58 - 65
  • [18] A SECURITY SIMULATION MODEL FOR LARGE SCALE DISTRIBUTED SYSTEMS
    Dobre, Ciprian
    Constantin, Florina
    Pop, Florin
    Cristea, Valentin
    [J]. EUROPEAN SIMULATION AND MODELLING CONFERENCE 2010, 2010, : 45 - 50
  • [19] Effective multicast programming in large scale distributed systems
    Eugster, PT
    Boichat, R
    Guerraoui, R
    Sventek, J
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2001, 13 (06): : 421 - 447
  • [20] Independent recovery in large-scale distributed systems
    Triantafillou, P
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (11) : 812 - 826