Strategies for Fast I/O Throughput in Large-scale Climate Modeling Applications

被引:0
|
作者
Sen, Koushik [1 ]
Vadhiyar, Sathish [2 ]
Vinayachandran, P. N. [3 ]
机构
[1] Qualcomm, Hyderabad, India
[2] Indian Inst Sci, Dept Computat & Data Sci, Bangalore, Karnataka, India
[3] Indian Inst Sci, Ctr Atmospher & Ocean Sci, Bangalore, Karnataka, India
关键词
Parallel I/O; ROMS climate model; Collective I/O; Lustre striping; netCDF;
D O I
10.1109/HiPC58850.2023.00038
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale HPC applications are highly data-intensive with significant times spent in I/O operations. Many large-scale scientific applications do not adequately optimize the I/O operations, leading to overall poor performance. In this work, we have developed two main strategies for providing fast I/O throughput for an important climate modeling application, namely, Regional Ocean Modeling System (ROMS) that uses NetCDF for I/O operations. The strategies include load balancing the I/O operations and selective writing of data. We have also implemented file striping to improve I/O performance. Our experiments with up to 1440 processor cores and 5 days of simulations showed that our load balancing strategy resulted in about 27% decrease in execution times over the default executions, our selective writing strategy resulted in a further decrease of about 30% and the optimized file striping resulted in a further decrease of about 12% in execution times. All the strategies combined together improved the overall performance of the application by about 70%.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [31] LARGE-SCALE APPLICATIONS OF SUPERCONDUCTIVITY
    FONER, S
    SCHWARTZ, BB
    JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 1979, 126 (03) : C153 - C153
  • [32] LARGE-SCALE INTERANNUAL VARIABILITY OF CLIMATE
    PARKER, DE
    METEOROLOGICAL MAGAZINE, 1982, 111 (1321): : 193 - 208
  • [33] Climate - Large-scale warming is not urban
    Parker, DE
    NATURE, 2004, 432 (7015) : 290 - 290
  • [34] Large-Scale Control on the Patagonian Climate
    Garreaud, R.
    Lopez, P.
    Minvielle, M.
    Rojas, M.
    JOURNAL OF CLIMATE, 2013, 26 (01) : 215 - 230
  • [35] A visualized parallel network simulator for modeling large-scale distributed applications
    Lin, Siming
    Cheng, Xueqi
    Lv, Jianming
    EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 339 - 346
  • [36] Capsule-Based User Interface Modeling for Large-Scale Applications
    Milicev, Dragan
    Mijailovic, Zarko
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (09) : 1190 - 1207
  • [37] Fast large-scale reionization simulations
    Thomas, Rajat M.
    Zaroubi, Saleem
    Ciardi, Benedetta
    Pawlik, Andreas H.
    Labropoulos, Panagiotis
    Jelic, Vibor
    Bernardi, Gianni
    Brentjens, Michiel A.
    de Bruyn, A. G.
    Harker, Geraint J. A.
    Koopmans, Leon V. E.
    Mellema, Garrelt
    Pandey, V. N.
    Schaye, Joop
    Yatawatta, Sarod
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2009, 393 (01) : 32 - 48
  • [38] Fast Large-Scale Trajectory Clustering
    Wang, Sheng
    Bao, Zhifeng
    Culpepper, J. Shane
    Sellis, Timos
    Qin, Xiaolin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 13 (01): : 29 - 42
  • [39] Large-scale, high-throughput production of lentiviral vectors for multiple disease applications
    Ashworth, Rachel C.
    Mac Reamoinn, Eoin
    Gamlen, Toby P. E.
    Pringle, Ian A.
    Gill, Deborah R.
    Hyde, Stephen C.
    HUMAN GENE THERAPY, 2018, 29 (07) : A6 - A6
  • [40] Biophysical Modeling of Large-Scale Brain Dynamics and Applications for Computational Psychiatry
    Murray, John D.
    Demirtas, Murat
    Anticevic, Alan
    BIOLOGICAL PSYCHIATRY-COGNITIVE NEUROSCIENCE AND NEUROIMAGING, 2018, 3 (09) : 777 - 787