共 50 条
- [1] Distributed Throughput Optimization for Large-Scale Scientific Workflows Under Fault-Tolerance Constraint [J]. Journal of Grid Computing, 2013, 11 : 361 - 379
- [3] Fault tolerance in large-scale scientific computing [J]. PARALLEL PROCESSING FOR SCIENTIFIC COMPUTING, 2006, : 203 - 220
- [4] Interoperability strategies for GASPI and MPI in large-scale scientific applications [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (03): : 554 - 568
- [5] Replication-based Fault-tolerance for Large-scale Graph Processing [J]. 2014 44TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2014, : 562 - 573
- [7] A portable fault-tolerance scheme for MPI [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 690 - 697
- [8] Low-cost fault-tolerance protocol for large-scale network monitoring [J]. COMPUTATIONAL SICENCE - ICCS 2003, PT III, PROCEEDINGS, 2003, 2659 : 504 - 513
- [9] EReinit: Scalable and efficient fault-tolerance for bulk-synchronous MPI applications [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (03):
- [10] Enhancing the fault-tolerance of nonmasking programs [J]. 23RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2002, : 441 - 449