Strategies for Fast I/O Throughput in Large-scale Climate Modeling Applications

被引:0
|
作者
Sen, Koushik [1 ]
Vadhiyar, Sathish [2 ]
Vinayachandran, P. N. [3 ]
机构
[1] Qualcomm, Hyderabad, India
[2] Indian Inst Sci, Dept Computat & Data Sci, Bangalore, Karnataka, India
[3] Indian Inst Sci, Ctr Atmospher & Ocean Sci, Bangalore, Karnataka, India
来源
2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023 | 2023年
关键词
Parallel I/O; ROMS climate model; Collective I/O; Lustre striping; netCDF;
D O I
10.1109/HiPC58850.2023.00038
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale HPC applications are highly data-intensive with significant times spent in I/O operations. Many large-scale scientific applications do not adequately optimize the I/O operations, leading to overall poor performance. In this work, we have developed two main strategies for providing fast I/O throughput for an important climate modeling application, namely, Regional Ocean Modeling System (ROMS) that uses NetCDF for I/O operations. The strategies include load balancing the I/O operations and selective writing of data. We have also implemented file striping to improve I/O performance. Our experiments with up to 1440 processor cores and 5 days of simulations showed that our load balancing strategy resulted in about 27% decrease in execution times over the default executions, our selective writing strategy resulted in a further decrease of about 30% and the optimized file striping resulted in a further decrease of about 12% in execution times. All the strategies combined together improved the overall performance of the application by about 70%.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [41] A gossip-based reliable multicast for large-scale high-throughput applications
    Sun, QX
    Sturman, DC
    DSN 2000: INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2000, : 347 - 358
  • [42] Detailed Analysis of I/O traces for large scale applications
    Nakka, N.
    Choudhary, A.
    Liao, W. K.
    Ward, L.
    Klundt, R.
    Weston, M. I.
    16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), PROCEEDINGS, 2009, : 419 - 427
  • [43] Turbo-GTS: A Fast Framework of Optimizing Task Throughput for Large-Scale Mobile Crowdsourcing
    Li, Wei
    Chen, Haiquan
    Ku, Wei-Shinn
    Qin, Xiao
    ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2020, 6 (01)
  • [44] STRATEGIES FOR LARGE-SCALE EDUCATIONAL REFORM
    GARTNER, A
    RIESSMAN, F
    TEACHERS COLLEGE RECORD, 1974, 75 (03): : 349 - 355
  • [45] Strategies for Large-scale Production of Polyhydroxyalkanoates
    Kaur, G.
    Roy, I.
    CHEMICAL AND BIOCHEMICAL ENGINEERING QUARTERLY, 2015, 29 (02) : 157 - 172
  • [46] Recent Survey of Large-Scale Systems: Architectures, Controller Strategies, and Industrial Applications
    Kordestani, Mojtaba
    Safavi, Ali Akbar
    Saif, Mehrdad
    IEEE SYSTEMS JOURNAL, 2021, 15 (04): : 5440 - 5453
  • [47] An I/O Efficient Model Checking Algorithm for Large-Scale Systems
    Wu, Lijun
    Huang, Huijia
    Su, Kaile
    Cai, Shaowei
    Zhang, Xiaosong
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (05) : 905 - 915
  • [48] Strategic directions in storage I/O issues in large-scale computing
    Gibson, GA
    Vitter, JS
    Wilkes, J
    ACM COMPUTING SURVEYS, 1996, 28 (04) : 779 - 793
  • [49] On Reducing I/O Overheads in Large-Scale Invariant Subspace Projections
    Aktulga, Hasan Metin
    Yang, Chao
    Catalyurek, Umit V.
    Maris, Pieter
    Vary, James P.
    Ng, Esmond G.
    EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT I, 2012, 7155 : 305 - 314
  • [50] A study of I/O methods for parallel visualization of large-scale data
    Yu, HF
    Ma, KL
    PARALLEL COMPUTING, 2005, 31 (02) : 167 - 183