An Adaptive Approach for Online Fault Management in Many-Core Architectures

被引:0
|
作者
Bolchini, Cristiana [1 ]
Miele, Antonio [1 ]
Sciuto, Donatella [1 ]
机构
[1] Politecn Milan, Dip Elettron & Informaz, Pzza L da Vinci 32, I-20133 Milan, Italy
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a dynamic scheduling solution to achieve fault tolerance in many-core architectures. Triple Modular Redundancy is applied on the multi-threaded application to dynamically mitigate the effects of both permanent and transient faults, and to identify and isolate damaged units. The approach targets the best performance, while balancing the use of the healthy resources to limit wear-out and aging effects, which cause permanent damages. Experimental results on synthetic case studies are reported, to validate the ability to tolerate faults while optimizing performance and resource usage.
引用
收藏
页码:1429 / 1432
页数:4
相关论文
共 50 条
  • [21] Graph Reachability on Parallel Many-Core Architectures
    Quer, Stefano
    Calabrese, Andrea
    [J]. COMPUTATION, 2020, 8 (04) : 1 - 26
  • [22] A Compressive Sensing Algorithm for Many-Core Architectures
    Borghi, A.
    Darbon, J.
    Peyronnet, S.
    Chan, T. F.
    Osher, S.
    [J]. ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 678 - 686
  • [23] Power Gating Clustered Many-Core Architectures
    Musoll, Enric
    [J]. JOURNAL OF LOW POWER ELECTRONICS, 2008, 4 (03) : 290 - 300
  • [24] On the Complexity of Mapping Feasibility in Many-Core Architectures
    Schwarzer, Tobias
    Roloff, Sascha
    Richthammer, Valentina
    Khaldi, Rami
    Wildermann, Stefan
    Glass, Michael
    Teich, Juergen
    [J]. 2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2018), 2018, : 176 - 183
  • [25] Accelerating Dedispersion Using Many-core Architectures
    Novotny, Jan
    Adamek, Karel
    Clark, M. A.
    Giles, Mike
    Armour, Wes
    [J]. ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2023, 269 (01):
  • [26] Fast Convolution Operations on Many-Core Architectures
    Li, Shigang
    Zhang, Yunquan
    Xiang, Chunyang
    Shi, Lei
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 316 - 323
  • [27] PoweRock: Power Modeling and Flexible Dynamic Power Management for Many-Core Architectures
    Lai, Zhiquan
    Lam, King Tin
    Wang, Cho-Li
    Su, Jinshu
    [J]. IEEE SYSTEMS JOURNAL, 2017, 11 (02): : 600 - 612
  • [28] IsoNet: Hardware-Based Job Queue Management for Many-Core Architectures
    Lee, Junghee
    Nicopoulos, Chrysostomos
    Lee, Hyung Gyu
    Panth, Shreepad
    Lim, Sung Kyu
    Kim, Jongman
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2013, 21 (06) : 1080 - 1093
  • [29] Scalable Thread Scheduling and Global Power Management for Heterogeneous Many-Core Architectures
    Winter, Jonathan A.
    Albonesi, David H.
    Shoemaker, Christine A.
    [J]. PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2010, : 29 - 39
  • [30] Adaptive Optimization of Sparse Matrix-Vector Multiplication on Emerging Many-Core Architectures
    Chen, Shizhao
    Fang, Jianbin
    Chen, Donglin
    Xu, Chuanfu
    Wang, Zheng
    [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 649 - 658