Scalable Incremental Checkpointing using GPU-Accelerated De-Duplication

被引:1
|
作者
Tan, Nigel [1 ]
Luettgau, Jakob [1 ]
Marquez, Jack [1 ]
Terianishi, Keita [2 ]
Morales, Nicolas [3 ]
Bhowmick, Sanjukta [4 ]
Cappello, Franck [5 ]
Taufer, Michela [1 ]
Nicolae, Bogdan [5 ]
机构
[1] Univ Tennessee Knoxville, Knoxville, TN 37996 USA
[2] Oak Ridge Natl Lab, Oak Ridge, TN USA
[3] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
[4] Univ North Texas, Denton, TX USA
[5] Argonne Natl Lab, Lemont, IL USA
基金
美国国家科学基金会;
关键词
Checkpointing; data versioning; incremental storage; deduplication; GPU parallelization;
D O I
10.1145/3605573.3605639
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Writing large amounts of data concurrently to stable storage is a typical I/O pattern of many HPCworkflows. This pattern introduces high I/O overheads and results in increased storage space utilization especially for workflows that need to capture the evolution of data structures with high frequency as checkpoints. In this context, many applications, such as graph pattern matching, perform sparse updates to large data structures between checkpoints. For these applications, incremental checkpointing techniques that save only the differences from one checkpoint to another can dramatically reduce the checkpoint sizes, I/O bottlenecks, and storage space utilization. However, such techniques are not without challenges: it is non-trivial to transparently determine what data has changed since a previous checkpoint and assemble the differences in a compact fashion that does not result in excessive metadata. State-of-art data reduction techniques (e.g., compression and de-duplication) have significant limitations when applied to modern HPC applications that leverage GPUs: slow at detecting the differences, generate a large amount of metadata to keep track of the differences, and ignore crucial spatiotemporal checkpoint data redundancy. This paper addresses these challenges by proposing a Merkle tree-based incremental checkpointing method to exploit GPUs' high memory bandwidth and massive parallelism. Experimental results at scale show a significant reduction of the I/O overhead and space utilization of checkpointing compared with state-of-the-art incremental checkpointing and compression techniques.
引用
收藏
页码:665 / 674
页数:10
相关论文
共 50 条
  • [31] GPU-accelerated registration of hyperspectral images using KAZE features
    Ordonez, Alvaro
    Arguello, Francisco
    Heras, Dora B.
    Demir, Beguem
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (12): : 9478 - 9492
  • [32] GPU-Accelerated Video Background Subtraction Using Gabor Detector
    Qin, Lixia
    Sheng, Bin
    Lin, Weiyao
    Wu, Wen
    Shen, Ruimin
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 32 : 1 - 9
  • [33] GPU-accelerated registration of hyperspectral images using KAZE features
    Álvaro Ordóñez
    Francisco Argüello
    Dora B. Heras
    Begüm Demir
    [J]. The Journal of Supercomputing, 2020, 76 : 9478 - 9492
  • [34] GPU-Accelerated Texture Analysis Using Steerable Riesz Wavelets
    Vizitiu, Anamaria
    Itu, Lucian
    Joyseeree, Ranveer
    Depeursinge, Adrien
    Muller, Henning
    Suciu, Constantin
    [J]. 2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 431 - 434
  • [35] GPU-Accelerated Computation for Texture Features using OpenCL Framework
    Saladin, Ahmad M.
    Jiao, Licheng
    Zhang, Xiangrong
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2014,
  • [36] Estimating the WCET of GPU-Accelerated Applications using Hybrid Analysis
    Betts, Adam
    Donaldson, Alastair
    [J]. PROCEEDINGS OF THE 2013 25TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS (ECRTS 2013), 2013, : 193 - 202
  • [37] Vispark: GPU-Accelerated Distributed Visual Computing Using Spark
    Choi, Woohyuk
    Jeong, Won-Ki
    [J]. 2015 IEEE 5TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2015, : 125 - 126
  • [38] GPU-Accelerated Query by Humming Using Modified SPRING Algorithm
    Yao, Guangchao
    Zheng, Yao
    Xiao, Limin
    Ruan, Li
    Li, Yongnan
    Zhang, Zhenzhong
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 654 - 663
  • [39] Trial Somaliland Voting Register De-Duplication Using Iris Recognition
    Bowyer, Kevin W.
    Ortiz, Estefan
    Sgroi, Amanda
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG): BIOMETRICS IN THE WILD (B-WILD 2015), VOL 2, 2015,
  • [40] ExaTN: Scalable GPU-Accelerated High-Performance Processing of General Tensor Networks at Exascale
    Lyakh, Dmitry I.
    Nguyen, Thien
    Claudino, Daniel
    Dumitrescu, Eugene
    McCaskey, Alexander J.
    [J]. FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2022, 8