Radiation-Induced Error Criticality in Modern HPC Parallel Accelerators

被引:22
|
作者
Oliveira, Daniel [1 ]
Pilla, Laercio [2 ]
Hanzich, Mauricio [3 ]
Fratin, Vinicius [1 ]
Fernandes, Fernando [1 ]
Lunardi, Caio [1 ]
Maria Cela, Jose [3 ]
Navaux, Philippe [1 ]
Carro, Luigi [1 ]
Rech, Paolo [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
[2] Univ Fed Santa Catarina, Dept Informat & Stat, Florianopolis, SC, Brazil
[3] Barcelona Supercomp Ctr, CASE Dept, Barcelona, Spain
关键词
SOFT-ERROR; FAULT-TOLERANCE;
D O I
10.1109/HPCA.2017.41
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we evaluate the error criticality of radiation-induced errors on modern High-Performance Computing (HPC) accelerators (Intel Xeon Phi and NVIDIA K40) through a dedicated set of metrics. We show that, as long as imprecise computing is concerned, the simple mismatch detection is not sufficient to evaluate and compare the radiation sensitivity of HPC devices and algorithms. Our analysis quantifies and qualifies radiation effects on applications' output correlating the number of corrupted elements with their spatial locality. Also, we provide the mean relative error (dataset-wise) to evaluate radiation-induced error magnitude. We apply the selected metrics to experimental results obtained in various radiation test campaigns for a total of more than 400 hours of beam time per device. The amount of data we gathered allows us to evaluate the error criticality of a representative set of algorithms from HPC suites. Additionally, based on the characteristics of the tested algorithms, we draw generic reliability conclusions for broader classes of codes. We show that arithmetic operations are less critical for the K40, while Xeon Phi is more reliable when executing particles interactions solved through Finite Difference Methods. Finally, iterative stencil operations seem the most reliable on both architectures.
引用
收藏
页码:577 / 588
页数:12
相关论文
共 50 条
  • [31] RADIATION-INDUCED NEOPLASIA
    COLMAN, M
    KIRSCH, M
    CREDITOR, M
    RADIATION RESEARCH, 1977, 70 (03) : 670 - 670
  • [32] RADIATION-INDUCED MENINGIOMA
    SRIDHAR, K
    NEUROSURGERY, 1991, 28 (03) : 482 - 482
  • [33] Radiation-induced cataracts
    Barnard, S.
    Moquet, I.
    Lloyd, S.
    Ellender, M.
    Ainsbury, E.
    Quinlan, R.
    ACTA OPHTHALMOLOGICA, 2017, 95
  • [34] RADIATION-INDUCED SCHWANNOMAS
    RUBINSTEIN, AB
    REICHENTHAL, E
    BOROHOV, H
    NEUROSURGERY, 1989, 24 (06) : 929 - 932
  • [35] Radiation-induced meningiomas
    Boljesíková, E
    Chorváth, M
    NEOPLASMA, 2001, 48 (06) : 442 - 444
  • [36] Radiation-induced nephropathy
    Bouillet, Thierry
    Ali, Ali Mohammed
    Thariat, Juliette
    BULLETIN DU CANCER, 2012, 99 (03) : 389 - 396
  • [37] Radiation-induced leukemia
    Finch, SC
    BLOOD, 2001, 97 (06) : 1897 - 1897
  • [38] Radiation-Induced Valvulopathy
    Seilani, Parisa
    Alizadeasl, Azin
    Fumani, Hosein Kamranzadeh
    Moradian, Maryam
    Ghorbanpoor, Mina
    Parhizgar, Seyed Ehsan
    IRANIAN HEART JOURNAL, 2022, 23 (01):
  • [39] Radiation-induced apoptosis
    Verheij, M
    Bartelink, H
    CELL AND TISSUE RESEARCH, 2000, 301 (01) : 133 - 142
  • [40] RADIATION-INDUCED OSTEOCHONDROMAS
    LIBSHITZ, HI
    COHEN, MA
    RADIOLOGY, 1982, 142 (03) : 643 - 647