共 50 条
- [1] A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems [J]. JOURNAL OF SUPERCOMPUTING, 2013, 65 (03): : 1302 - 1326
- [2] Checkpoint/Restart and Beyond: Resilient High Performance Computing with FPGAs [J]. 2011 IEEE 19TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2011, : 162 - 169
- [3] An optimal checkpoint/restart model for a large scale High Performance Computing system [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 1491 - +
- [4] Methods and Tools to Increase Fault Tolerance of High-Performance Computing Systems [J]. 2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 226 - 230
- [7] Survey of biological high performance computing: Algorithms, implementations and outlook research [J]. 2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 1097 - +
- [8] Fault Tolerance in Cloud Computing - Survey [J]. 2015 11TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2015, : 241 - 245
- [9] Algorithm-based fault tolerance for spaceborne computing: Basis and implementations [J]. 2000 IEEE AEROSPACE CONFERENCE PROCEEDINGS, VOL 4, 2000, : 411 - 420