Power-Aware Checkpointing for Multicore Embedded Systems

被引:0
|
作者
Ansari, Mohsen [1 ]
Safari, Sepideh [2 ]
Khdr, Heba [3 ]
Gohari-Nazari, Pourya [1 ]
Henkel, Joerg [3 ]
Ejlali, Alireza [1 ]
Hessabi, Shaahin [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran 14588, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran 1953833511, Iran
[3] Karlsruhe Inst Technol, D-76131 Karlsruhe, Germany
关键词
Task analysis; Checkpointing; Timing; Power demand; Real-time systems; Embedded systems; Reliability; Peak power consumption; checkpointing; multicore platforms; embedded systems; FAULT-TOLERANCE; MANAGEMENT; RECOVERY;
D O I
10.1109/TPDS.2022.3188568
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Increasing the number of cores integrated on a single chip offers a great potential for the implementation of fault-tolerant techniques to achieve high reliability in real-time embedded systems. Checkpointing with rollback-recovery is a well-established technique to tolerate transient faults in multicore platforms. To consider the worst-case fault occurrence scenario, checkpointing technique requires to re-execute some parts of the tasks, and that might lead to simultaneous execution of task parts with high power consumptions, which eventually might result in a peak power increase beyond the thermal design power (TDP). Exceeding TDP can elevate on-chip temperatures beyond safe limits, and thereby triggering countermeasures that throttle down the voltage and frequency levels or power gate the cores. Such countermeasures might lead to violating task deadlines and degrading the system's reliability. To avoid such severe scenarios, it is inevitable to consider the impact of applying fault-tolerant techniques on the power consumption and prevent violating the power constraint of the chip, i.e., TDP. This article presents for the first time, a peak-power-aware checkpointing (PPAC) technique that tolerates a given number of faults, k, while at the same time meets the power constraint in hard real-time embedded systems. To do this, our proposed technique (PPAC) adjusts the timing of the checkpoints, which have lower power consumption than the tasks to the execution time points that have power spikes beyond TDP. Moreover, PPAC exploits the available slack times on the cores to delay the execution of some tasks to avoid the remaining power spikes beyond TDP, which could not be mitigated by solely adjusting checkpoints. To evaluate our technique, we extend the state-of-the-art system-level simulator, gem5, with the state-of-the-art checkpointing module in Linux. Our experimental results show that our proposed technique is able to tolerate a given number of faults without exceeding the timing and power constraints in hard real-time embedded systems. The resulting peak power reduction achieved by our technique compared to state-of-the-art techniques is an average of 23%. Moreover, our technique employs the Dynamic Power Management (DPM) during the slack times resulting at runtime in the case of fault-free scenarios, which provides energy savings with an average of 17.28% and up to 61.1%.
引用
收藏
页码:4410 / 4424
页数:15
相关论文
共 50 条
  • [1] A Dynamic Power-Aware Partitioner with Task Migration for Multicore Embedded Systems
    March, Jose Luis
    Sahuquillo, Julio
    Petit, Salvador
    Hassan, Houcine
    Duato, Jose
    [J]. EURO-PAR 2011 PARALLEL PROCESSING, PT 1, 2011, 6852 : 218 - 229
  • [2] The Design and Experiments of A SID-Based Power-Aware Simulator for Embedded Multicore Systems
    Lin, Cheng-Yen
    Huang, Chung-Wen
    Kuan, Chi-Bang
    Huang, Shi-Yu
    Lee, Jenq-Kuen
    [J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2015, 20 (02)
  • [3] Power-aware scheduling with effective task migration for real-time multicore embedded systems
    Luis March, Jose
    Sahuquillo, Julio
    Petit, Salvador
    Hassan, Houcine
    Duato, Jose
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (14): : 1987 - 2001
  • [4] Enabling Power-Aware Software in Embedded Systems
    Bonnett, James
    Fox, Paul
    Paolini, Aaron
    Markey, Adam
    Kozacik, Stephen
    Kelmelis, Eric
    [J]. MODELING AND SIMULATION FOR DEFENSE SYSTEMS AND APPLICATIONS XI, 2016, 9848
  • [5] A unified framework for power-aware design of embedded systems
    Ayala, JL
    López-Vallejo, M
    [J]. INTEGRATED CIRCUIT AND SYSTEM DESIGN: POWER AND TIMING MODELING, OPTIMIZATION AND SIMULATION, 2003, 2799 : 239 - 248
  • [6] IMPACCT: Methodology and tools for power-aware embedded systems
    Chou, PH
    Liu, JF
    Li, DX
    Bagherzadeh, N
    [J]. DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2002, 7 (03) : 205 - 232
  • [7] IMPACCT: Methodology and Tools for Power-Aware Embedded Systems
    Pai H. Chou
    Jinfeng Liu
    Dexin Li
    Nader Bagherzadeh
    [J]. Design Automation for Embedded Systems, 2002, 7 : 205 - 232
  • [8] Checkpointing alternatives for high performance, power-aware processors
    Moshovos, A
    [J]. ISLPED'03: PROCEEDINGS OF THE 2003 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2003, : 318 - 321
  • [9] Power-aware systems
    Bhardwaj, M
    Min, R
    Chandrakasan, A
    [J]. CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2000, : 1695 - 1701
  • [10] Power-Aware Design Techniques of Secure Multimode Embedded Systems
    Jiang, Ke
    Eles, Petru
    Peng, Zebo
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2016, 15 (01)