Impacts of Three Soft-Fault Models on Hybrid Parallel Asynchronous Iterative Methods

被引:0
|
作者
Coleman, Evan [1 ,2 ]
Jensen, Erik J. [2 ]
Sosonkina, Masha [2 ]
机构
[1] Naval Surface Warfare Ctr, Dahlgren Div, Dahlgren, VA 22448 USA
[2] Old Dominion Univ, Modeling Simulat & Visualizat Engn Dept, Norfolk, VA 23529 USA
来源
2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018) | 2018年
关键词
Fault modeling; fault tolerance; hybrid parallelism; asynchronous iterative methods;
D O I
10.1109/SBAC-PAD.2018.00076
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This study seeks to understand the soft error vulnerability of asynchronous iterative methods, with a focus on stationary iterative solvers such as Jacobi. The implementations make use of hybrid parallelism where the computational work is distributed over multiple nodes using MPI and parallelized on each node using OpenMP. A series of experiments is conducted to measure the impact of an undetected soft fault on an asynchronous iterative method, and to compare and contrast several techniques for simulating the occurrence of a fault and then recovering from the effects of the faults. The data shows that the two numerical soft-fault models tested here more consistently than a "bit-flip" model produce bad enough behavior to test a variety of recovery strategies, such as those based on partial checkpointing.
引用
收藏
页码:458 / 465
页数:8
相关论文
共 40 条
  • [21] A survey of decision making methods based on two classes of hybrid soft set models
    Ma, Xueling
    Zhan, Jianming
    Ali, Muhammad Irfan
    Mehmood, Nayyar
    ARTIFICIAL INTELLIGENCE REVIEW, 2018, 49 (04) : 511 - 529
  • [22] JACEP2P-V2: A Fully Decentralized and Fault Tolerant Environment for Executing Parallel Iterative Asynchronous Applications on Volatile Distributed Architectures
    Charr, Jean-Claude
    Couturier, Raphael
    Laiymani, David
    ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS, 2009, 5529 : 446 - 458
  • [23] JACEP2P-V2: A fully decentralized and fault tolerant environment for executing parallel iterative asynchronous applications on volatile distributed architectures
    Charr, Jean-Claude
    Couturier, Raphael
    Laiymani, David
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (05): : 606 - 613
  • [24] JACEP2P-V2: A fully decentralized and fault tolerant environment for executing parallel iterative asynchronous applications on volatile distributed architectures
    Laboratory of Computer Sciences, University of Franche-Comté , IUT de Belfort-Montbéliard, Rue Engel Gros, BP 527, 90016 Belfort, France
    Future Gener Comput Syst, 5 (606-613):
  • [25] Parallel iterative solvers for finite-element methods using an OpenMP/MPI hybrid programming model on the Earth Simulator
    Nakajima, K
    PARALLEL COMPUTING, 2005, 31 (10-12) : 1048 - 1065
  • [26] Parallel implementation of hybrid direct-iterative algorithm for multibody dynamics via Krylov subspace methods on IBM 1350 cluster
    Duan, Shanzhong
    Patel, Yogesh
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 1997 - 2002
  • [27] Comparison of Facial Soft Tissue Measurements on Three-Dimensional Images and Models Obtained With Different Methods
    Germec-Cakan, Derya
    Canter, Halil Ibrahim
    Nur, Burcu
    Arun, Tulin
    JOURNAL OF CRANIOFACIAL SURGERY, 2010, 21 (05) : 1393 - 1399
  • [28] Three-Phase Asynchronous Motor Fault Diagnosis Using Attention Mechanism and Hybrid CNN-MLP By Multi-Sensor Information
    Zhou, Yi
    Shang, Qianming
    Guan, Cong
    IEEE ACCESS, 2023, 11 : 98402 - 98414
  • [29] Three-level hybrid vs. flat MPI on the Earth Simulator: Parallel iterative solvers for finite-element method
    Nakajima, K
    APPLIED NUMERICAL MATHEMATICS, 2005, 54 (02) : 237 - 255
  • [30] Three-dimensional parallel frequency-domain visco-acoustic wave modelling based on a hybrid direct/iterative solver
    Sourbier, Florent
    Haidar, Azzam
    Giraud, Luc
    Ben-Hadj-Ali, Hafedh
    Operto, Stephane
    Virieux, Jean
    GEOPHYSICAL PROSPECTING, 2011, 59 (05) : 834 - 856