Memory Controller with Adaptive ECC for Reliable System Operation

被引:1
|
作者
Stefani, Marco [1 ]
Marcon, Cesar [1 ]
Silva, Felipe [2 ]
Silveira, Jarbas [2 ]
机构
[1] Pontif Catholic Univ Rio Grande Do Sul PUCRS, Grad Program Comp Sci PPGCC, Porto Alegre, RS, Brazil
[2] Fed Univ Ceara UFC, Engn & Comp Syst Lab LESC, DETI, Fortaleza, Ceara, Brazil
关键词
Memory Controller; Fault Tolerance; Dynamic Error Correction Code; DRAM ERRORS;
D O I
10.1109/SBCCI60457.2023.10261959
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory errors can cause crashes and data loss, which are unacceptable for various computing systems, mainly large servers. Memory controllers can mitigate these errors by employing an Error Correction Code (ECC) in the data write and read flows. This work proposes a fault-tolerant mechanism that acts as a memory controller's encoding and decoding manager. This mechanism adapts the ECC for each memory block based on the efficacies of the ECCs available in the controller and the error rate captured at runtime. Consequently, memory blocks with a high error rate can be recoded to a high efficacy ECC and vice versa. Experimental results show that our proposal achieves high error correction efficacy with high energy efficiency.
引用
收藏
页码:125 / 130
页数:6
相关论文
共 50 条
  • [1] CLEAN-ECC: High Reliability ECC for Adaptive Granularity Memory System
    Gong, Seong-Lyong
    Rhu, Minsoo
    Kim, Jungrae
    Chung, Jinsuk
    Erez, Mattan
    [J]. PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, : 611 - 622
  • [2] Compression and Variable-Sized ECC Scheme for the Reliable Flash Memory System
    Kim, Kijin
    Lim, Seung-Ho
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 1232 - 1236
  • [3] Reconfigurable ECC for Adaptive Protection of Memory
    Basak, Abhishek
    Paul, Somnath
    Park, Jangwon
    Park, Jongsun
    Bhunia, Swarup
    [J]. 2013 IEEE 56TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2013, : 1085 - 1088
  • [4] An Intelligent Controller based Power Grid Interconnected System for Reliable Operation
    Raaj, Vellarivelli B. Thurai
    Suresh, Krishnan
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 608 - 613
  • [5] Theory and operation of a robust controller for a compact adaptive optics system
    Frazier, BW
    Tyson, RK
    Smith, M
    Roche, J
    [J]. OPTICAL ENGINEERING, 2004, 43 (12) : 2912 - 2920
  • [6] Adaptive ECC for Tailored Protection of Nanoscale Memory
    Shin, Dongyeob
    Park, Jongsun
    Park, Jangwon
    Paul, Somnath
    Bhunia, Swarup
    [J]. IEEE DESIGN & TEST, 2017, 34 (06) : 84 - 93
  • [7] ECC-Aware Fast and Reliable Pattern Matching Redundancy Analysis for Highly Reliable Memory
    Han, Donghyun
    Lee, Hayoung
    Lee, Seungtaek
    Kang, Sungho
    [J]. IEEE ACCESS, 2021, 9 : 133274 - 133288
  • [8] Bamboo ECC: Strong, Safe, and Flexible Codes for Reliable Computer Memory
    Kim, Jungrae
    Sullivan, Michael
    Erez, Mattan
    [J]. 2015 IEEE 21ST INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2015, : 101 - 112
  • [9] Adaptive robust reliable controller for uncertain time-delay system with actuator failures
    Qiao, Jun-Li
    Zhang, Jing-Mei
    Jia, Xin-Chun
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 2160 - +
  • [10] An Adaptive Protection Strategy for Reliable Operation of Microgrids
    Sedghisigarchi, Kourosh
    Sardari, Keyvan Talebizadeh
    [J]. 2018 IEEE INTERNATIONAL ENERGY CONFERENCE (ENERGYCON), 2018,