共 50 条
- [31] Job migration in HPC clusters by means of checkpoint/restart The Journal of Supercomputing, 2019, 75 : 6517 - 6541
- [32] An Examination of the Impact of Failure Distribution on Coordinated Checkpoint/Restart PROCEEDINGS OF THE ACM WORKSHOP ON FAULT-TOLERANCE FOR HPC AT EXTREME SCALE (FTXS'16), 2016, : 35 - 42
- [33] Job migration in HPC clusters by means of checkpoint/restart JOURNAL OF SUPERCOMPUTING, 2019, 75 (10): : 6517 - 6541
- [34] Efficient Encoding and Reconstruction of HPC Datasets for Checkpoint/Restart 2019 35TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST 2019), 2019, : 79 - 91
- [35] Message logging optimization for wireless networks 20TH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2001, : 182 - 185
- [36] The cost of recovery in message logging protocols SEVENTEENTH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 10 - 18
- [37] Improving Message Logging Protocols Scalability through Distributed Event Logging EURO-PAR 2010 PARALLEL PROCESSING, PT I, 2010, 6271 : 511 - 522
- [39] An efficient algorithm for causal message logging SEVENTEENTH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 19 - 25
- [40] CMLOG: A common message logging system ACCELERATOR AND LARGE EXPERIMENTAL PHYSICS CONTROL SYSTEMS, 1997, : 358 - 363