Fault Detection in Multi-Core Processors Using Chaotic Maps

被引:0
|
作者
Rao, Nageswara S. V. [1 ]
机构
[1] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37831 USA
关键词
Exascale systems; chaotic maps; multi-core processors; fault detection; resilience;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Exascale systems built using multi-core processors are expected to experience several component faults during code executions lasting for hours. It is important to detect faults in processor cores so that faulty cores can be removed from scheduler pools, nodes with high failures can be swapped out, applications can be migrated, and check-point recoveries can be initiated. We propose light-weight codes that utilize chaotic computations and customized threads to detect component faults in multi-core processors. They concurrently execute dedicated threads that implement Poincare and identity maps, which are customized to isolate faults in arithmetic operations, memory elements and interconnects. The instruction execution errors and local memory errors are detected by threads dedicated to processor cores, and errors in inter-processor cross-connects are detected by global-local memory movements. We present preliminary implementation results on 4- and 48-core HP workstations under simulated faults.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 50 条
  • [1] Language identification using multi-core processors
    Hanani, A.
    Carey, M. J.
    Russell, M. J.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (05): : 371 - 383
  • [2] Program Phase Detection in Heterogeneous Multi-Core Processors
    Jooya, A. Z.
    Analoui, M.
    [J]. 2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 218 - 223
  • [3] Fault Tolerant Computing for Stream DSP Applications using GALS Multi-core Processors
    Yu, Zhiyi
    Shi, Zewen
    Zeng, Xiaoyang
    [J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 2305 - 2308
  • [4] A Freespace Crossbar for Multi-core Processors
    Victor, Michel N.
    Silzars, Aris K.
    Davidson, Edward S.
    [J]. ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2008, : 56 - +
  • [5] Thermal modeling of multi-core processors
    Xu, Guoping
    [J]. 2006 PROCEEDINGS 10TH INTERSOCIETY CONFERENCE ON THERMAL AND THERMOMECHANICAL PHENOMENA IN ELECTRONICS SYSTEMS, VOLS 1 AND 2, 2006, : 96 - 100
  • [6] Power Consumption in Multi-core Processors
    Balakrishnan, M.
    [J]. CONTEMPORARY COMPUTING, 2012, 306 : 3 - 3
  • [7] Using Multi-core Processors to Support Network Security Applications
    Xiang, Yang
    Zhou, Wanlei
    [J]. 12TH IEEE INTERNATIONAL WORKSHOP ON FUTURE TRENDS OF DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2008, : 213 - 218
  • [8] Performance Optimization of Multi-Core Processors using Core Hopping - Thermal and Structural
    Lingampalli, Sunil
    Mirza, Fahad
    Raman, Thiagarajan
    Agonafer, Dereje
    [J]. 2012 28TH ANNUAL IEEE SEMICONDUCTOR THERMAL MEASUREMENT AND MANAGEMENT SYMPOSIUM (SEMI-THERM), 2012, : 112 - 117
  • [9] Core Interface Optimization for Multi-core Neuromorphic Processors
    Su, Zhe
    Hwang, Hyunjung
    Torchet, Tristan
    Indiveri, Giacomo
    [J]. 2023 28TH IEEE INTERNATIONAL SYMPOSIUM ON ASYNCHRONOUS CIRCUITS AND SYSTEMS, ASYNC, 2023, : 89 - 98
  • [10] A Task Scheduling Algorithm for Multi-core Processors
    Yao, Xuanxia
    Geng, Peng
    Du, Xiaojiang
    [J]. 2013 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2013, : 259 - 264