共 50 条
- [1] An optimal checkpoint/restart model for a large scale High Performance Computing system [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 1491 - +
- [2] A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems [J]. JOURNAL OF SUPERCOMPUTING, 2013, 65 (03): : 1302 - 1326
- [3] A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems [J]. The Journal of Supercomputing, 2013, 65 : 1302 - 1326
- [4] Combining XOR and Partner Checkpointing for Resilient Multilevel Checkpoint/Restart [J]. 2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 277 - 288
- [5] Integrating FPGAs in High-Performance Computing: Introduction [J]. FPGA 2007: FIFTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2007, : 131 - 131
- [6] Leveraging Near Data Processing for High-Performance Checkpoint/Restart [J]. SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
- [7] Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 409 - 420
- [8] Multilevel Checkpoint/Restart for Large Computational Jobs on Distributed Computing Resources [J]. 2019 IEEE 38TH INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2019), 2019, : 143 - 152
- [9] System-level Scalable Checkpoint-Restart for Petascale Computing [J]. 2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 932 - 941
- [10] Integrating FPGAs in High-Performance Computing: The Architecture and Implementation Perspective [J]. FPGA 2007: FIFTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2007, : 132 - 132