SHiFA: System-Level Hierarchy in Run-Time Fault-Aware Management of Many-Core Systems

被引:3
|
作者
Fattah, Mohammad [1 ]
Palesi, Maurizio [2 ]
Liljeberg, Pasi [1 ]
Plosila, Juha [1 ]
Tenhunen, Hannu [1 ]
机构
[1] Univ Turku, Turku, Finland
[2] Univ Enna, Kore, Italy
关键词
application mapping; system-level design; hierarchical management;
D O I
10.1145/2593069.2593214
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A system-level approach to fault-aware resource management of many-core systems is proposed. The proposed approach, called SHiFA, is able to tolerate run-time faults at system level without any hardware overhead. In contrast to the existing system-level methods, network resources are also considered to be potentially faulty. Accordingly, applications are mapped onto healthy nodes of the system at run-time such that their interaction will not require the use of faulty elements. By utilizing the simple routing approach, results show 100% utilizability of PEs and 99.41% of successful mapping when up to 8 links are broken. SHiFA design is based on distributed operating systems, such that it is kept scalable for future many-core systems. A significant improvement in scalability properties is observed compared to the state-of-the-art distributed approaches.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Design Methodology and Run-time Management for Predictable Many-Core Systems
    Wildermann, Stefan
    Weichslgartner, Andreas
    Teich, Juergen
    [J]. 2015 IEEE 18TH INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING WORKSHOPS, 2015, : 103 - 110
  • [2] PARMA: Parallelization-Aware Run-Time Management for Energy-Efficient Many-Core Systems
    Al-hayanni, Mohammed A. Noaman
    Rafiev, Ashur
    Xia, Fei
    Shafik, Rishad
    Romanovsky, Alexander
    Yakovlev, Alex
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (10) : 1507 - 1518
  • [3] Applying an Integrated Modelling Process to Run-time Management of Many-Core Systems
    Fathabadi, Asieh Salehi
    Snook, Colin
    Butler, Michael
    [J]. INTEGRATED FORMAL METHODS, IFM 2014, 2014, 8739 : 120 - 135
  • [4] Run-time timing prediction for system reconfiguration on many-core embedded systems
    Li, Zheng
    He, Shuibing
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 95 : 47 - 54
  • [5] DRACON: A Dedicated Hardware Infrastructure for Scalable Run-Time Management on Many-Core Systems
    Gregorek, Daniel
    Rust, Jochen
    Garcia-Ortiz, Alberto
    [J]. IEEE ACCESS, 2019, 7 : 121931 - 121948
  • [6] Machine Learning for Run-Time Energy Optimisation in Many-Core Systems
    Biswas, Dwaipayan
    Balagopal, Vibishna
    Shafik, Rishad
    Al-Hashimi, Bashir M.
    Merrett, Geoff V.
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1588 - 1592
  • [7] Prediction Based Run-Time Reconfiguration on Many-core Embedded Systems
    Li, Zheng
    He, Shuibing
    Wang, Li
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 2, 2017, : 140 - 146
  • [8] Artificial bee colony-inspired run-time task management for many-core systems
    Abuassal, Ali
    Tempesti, Gianluca
    Trefzer, Martin A.
    [J]. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1084 - 1091
  • [9] Distributed run-time resource management for malleable applications on many-core platforms
    Anagnostopoulos, Iraklis
    Tsoutsouras, Vasileios
    Bartzas, Alexandros
    Soudris, Dimitrios
    [J]. 2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [10] Adjustable Contiguity of Run-Time Task Allocation in Networked Many-Core Systems
    Fattah, Mohammad
    Liljeberg, Pasi
    Plosila, Juha
    Tenhunen, Hannu
    [J]. 2014 19TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2014, : 349 - 354