Hardware Performance Counters for System Reliability Monitoring

被引:0
|
作者
Leng, Elena Woo Lai [1 ]
Zwolinski, Mark [1 ]
Halak, Basel [1 ]
机构
[1] Univ Southampton, Dept Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
VARIABILITY;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As technology scaling reaches nanometre scales, the error rate due to variations in temperature and voltage, single event effects and component degradation increases, making components less reliable. In order to ensure a system continues to function correctly while facing known reliability issues, it is imperative that the system should have the means to detect the occurrence of errors due to the presence of faults. A system that behaves normally (no error detected in the system) exhibits a profile, and any deviations from this profile indicate that there is an anomaly in the system. In this paper, we propose to use hardware performance counters (HPCs) to measure events that occur during the execution of the program. We explore the various counters available which could be use to identify the anomalous behaviour in the system and develop a methodology to observe the anomalies using HPCs by creating a fault-free pattern and observing any subsequent changes in that pattern. We evaluate the proposed technique using GemFI, an architectural simulator based on Gem5 with additional fault injection capabilities. We compare the results obtained at the end of the execution with data collected during a time interval. Our results show that HPCs can be used to identify anomalous behaviour in a system that would lead to failure.
引用
下载
收藏
页码:76 / 81
页数:6
相关论文
共 50 条
  • [1] An Automated Approach to Hardware Performance Monitoring Counters
    Tinetti, Fernando G.
    Mendez, Mariano
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 71 - 76
  • [2] A hardware counters based tool for system monitoring
    Ferreto, TC
    DeRose, L
    De Rose, CAF
    EURO-PAR 2003 PARALLEL PROCESSING, PROCEEDINGS, 2003, 2790 : 7 - 16
  • [3] DRAC: Adaptive control system with hardware performance counters
    Pillon, M
    Richard, O
    Da Costa, G
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 55 - 63
  • [4] Exploiting hardware performance counters
    Uhsadel, Leif
    Georges, Andy
    Verbauwhede, Ingrid
    FDTC 2008: FAULT DIAGNOSIS AND TOLERANCE IN CRYPTOGRAPHY, PROCEEDINGS, 2008, : 59 - +
  • [5] System execution path profiling using hardware performance counters
    Giraldeau, Francis
    Ezzati-Jivan, Naser
    Dagenais, Michel R.
    2021 15TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON 2021), 2021,
  • [6] Can Hardware Performance Counters be Trusted?
    Weaver, Vincent M.
    Mckee, Sally A.
    2008 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2008, : 131 - 140
  • [7] Ransomware Classification Using Hardware Performance Counters on a Non-Virtualized System
    Hill, Jennie E.
    Walker, T. Owens
    Blanco, Justin A.
    Ives, Robert W.
    Rakvic, Ryan
    Jacob, Bruce
    IEEE ACCESS, 2024, 12 : 63865 - 63884
  • [8] Malicious Firmware Detection with Hardware Performance Counters
    Wang, Xueyang
    Konstantinou, Charalambos
    Maniatakos, Michail
    Karri, Ramesh
    Lee, Serena
    Robison, Patricia
    Stergiou, Paul
    Kim, Steve
    IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2016, 2 (03): : 160 - 173
  • [9] Profiling Cloud Applications with Hardware Performance Counters
    Kandalintsev, Alexandre
    Lo Cigno, Renato
    Kliazovich, Dzmitry
    Bouvry, Pascal
    2014 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2014), 2014, : 52 - 57
  • [10] Detection of Cache Side Channel Attacks Using Thread Level Monitoring of Hardware Performance Counters
    Bhade, Pavitra Prakash
    Sinha, Sharad
    2021 IEEE 14TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2021), 2021, : 210 - 217