Tuning Parallel Data Compression and I/O for Large-scale Earthquake Simulation

被引:7
|
作者
Tang, Houjun [1 ]
Byna, Suren [1 ]
Petersson, N. Anders [2 ]
McCallen, David [1 ,3 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[2] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
[3] Univ Nevada, Reno, NV 89557 USA
关键词
D O I
10.1109/BigData52589.2021.9671876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scientific applications, such as those simulating earthquakes, the origins of universe, etc., often produce massive amounts of data as high-performance computing (HPC) systems are moving toward exascale. The ever-increasing volumes of data are posing challenges for scientists to store, share, analyze, and visualize. Compression algorithms have become a crucial component for data management in scientific workflows. Data reduction enables simulations to output more data without worrying about exceeding storage quotas, and could capture more insights in the simulation. However, due to the complexity and poor performance of I/O and compression libraries as well as parallel file systems, the overall compression and I/O performance varies significantly. In this paper, we explore tuning parallel compression of data produced by a large-scale earthquake simulation. We show that our strategies achieve up to 13X performance improvement and a compression ratio of up to 251.
引用
收藏
页码:2992 / 2997
页数:6
相关论文
共 50 条
  • [1] A study of I/O methods for parallel visualization of large-scale data
    Yu, HF
    Ma, KL
    PARALLEL COMPUTING, 2005, 31 (02) : 167 - 183
  • [2] Parallel Tensor Compression for Large-Scale Scientific Data
    Austin, Woody
    Ballard, Grey
    Kolda, Tamara G.
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 912 - 922
  • [3] Specification of a parallel I/O control agent for large-scale simulation users
    Matsuoka, Y
    Matsuyama, H
    Koganeyania, M
    Watanabe, C
    Ueshima, Y
    Joe, K
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1120 - 1126
  • [4] Applying Practical Parallel Grammar Compression to Large-scale Data
    Matsushita, Masaki
    Inoguchi, Yasushi
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 473 - 473
  • [5] Parallel simulation of large-scale parallel applications
    Bagrodia, R
    Deelman, E
    Phan, T
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (01): : 3 - 12
  • [6] Parallel Compression and Indexing of Large-Scale Geospatial Raster Data with GPGPUs
    Kaligirwa, Nathalie
    Leal, Eleazar
    Gruenwald, Le
    Zhang, Jianting
    You, Simin
    2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 137 - 144
  • [7] Scalable I/O of large-scale molecular dynamics simulations: A data-compression algorithm
    Omeltchenko, A
    Campbell, TJ
    Kalia, RK
    Liu, XL
    Nakano, A
    Vashishta, P
    COMPUTER PHYSICS COMMUNICATIONS, 2000, 131 (1-2) : 78 - 85
  • [8] Parallel implementation of large-scale CFD data compression toward aeroacoustic analysis
    Sakai, Ryotaro
    Sasaki, Daisuke
    Nakahashi, Kazuhiro
    COMPUTERS & FLUIDS, 2013, 80 : 116 - 127
  • [9] Large-scale parallel data clustering
    Judd, D
    McKinley, PK
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 871 - 876
  • [10] Parallel simulation techniques for large-scale networks
    Bhatt, S
    Fujimoto, R
    Ogielski, A
    Perumalla, K
    IEEE COMMUNICATIONS MAGAZINE, 1998, 36 (08) : 42 - 47