Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization

被引:9
|
作者
Wan, Lipeng [1 ]
Huebl, Axel [2 ]
Gu, Junmin [2 ]
Poeschel, Franz [3 ]
Gainaru, Ana [1 ]
Wang, Ruonan [1 ]
Chen, Jieyang [1 ]
Liang, Xin [5 ]
Ganyushin, Dmitry [1 ]
Munson, Todd [4 ]
Foster, Ian [4 ]
Vay, Jean-Luc [2 ]
Podhorszki, Norbert [1 ]
Wu, Kesheng [2 ]
Klasky, Scott [1 ]
机构
[1] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37830 USA
[2] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[3] Ctr Adv Syst Understanding CASUS, D-02826 Gorlitz, Germany
[4] Argonne Natl Lab, Lemont, IL 60439 USA
[5] Missouri Univ Sci & Technol, Rolla, MO 65409 USA
关键词
Layout; Arrays; Heuristic algorithms; Computational modeling; Performance evaluation; Optimization; Distributed databases; Parallel IO; data layout; IO performance; WarpX; data access optimization;
D O I
10.1109/TPDS.2021.3100784
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The applications being developed within the U.S. Exascale Computing Project (ECP) to run on imminent Exascale computers will generate scientific results with unprecedented fidelity and record turn-around time. Many of these codes are based on particle-mesh methods and use advanced algorithms, especially dynamic load-balancing and mesh-refinement, to achieve high performance on Exascale machines. Yet, as such algorithms improve parallel application efficiency, they raise new challenges for I/O logic due to their irregular and dynamic data distributions. Thus, while the enormous data rates of Exascale simulations already challenge existing file system write strategies, the need for efficient read and processing of generated data introduces additional constraints on the data layout strategies that can be used when writing data to secondary storage. We review these I/O challenges and introduce two online data layout reorganization approaches for achieving good tradeoffs between read and write performance. We demonstrate the benefits of using these two approaches for the ECP particle-in-cell simulation WarpX, which serves as a motif for a large class of important Exascale applications. We show that by understanding application I/O patterns and carefully designing data layouts we can increase read performance by more than 80 percent.
引用
收藏
页码:878 / 890
页数:13
相关论文
共 50 条
  • [1] Improving cache performance in dynamic applications through data and computation reorganization at run time
    Ding, C
    Kennedy, K
    [J]. ACM SIGPLAN NOTICES, 1999, 34 (05) : 229 - 241
  • [2] Improving I/O Performance with Adaptive Data Compression for Big Data Applications
    Zou, Hongbo
    Yu, Yongen
    Tang, Wei
    Chen, Hsuanwei Michelle
    [J]. PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 1229 - 1238
  • [3] Improving the performance of cluster applications through I/O proxy architecture
    Sanchez, Luis Miguel
    Isaila, F.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2, 2006, : 567 - +
  • [4] Improving Performance of Parallel I/O Systems through Selective and Layout-Aware SSD Cache
    He, Shuibing
    Wang, Yang
    Sun, Xian-He
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (10) : 2940 - 2952
  • [5] Improving I/O performance of applications through compiler-directed code restructuring
    Kandemir, Mahmut
    Son, Seung Woo
    Karakoy, Mustafa
    [J]. PROCEEDINGS OF THE 6TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES (FAST '08), 2008, : 159 - +
  • [6] Fine-Granular Computation and Data Layout Reorganization for Improving Locality
    Kandemir, Mahmut
    Tang, Xulong
    Kotra, Jagadish
    Karakoy, Mustafa
    [J]. 2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [7] Dynamic Data Layout Optimization for High Performance Parallel I/O
    Rush, Everett Neil
    Harris, Bryan
    Altiparmak, Nihat
    Tosan, Ali Saman
    [J]. PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 132 - 141
  • [8] Characterizing I/O Workloads of HPC Applications Through Online Analysis
    Dong, Wenrui
    Liu, Guangming
    Yu, Jie
    Zuo, You
    [J]. 2015 IEEE 34TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2015,
  • [9] Improving the performance of I/O-intensive applications on clusters of workstations
    Xiao Qin
    Hong Jiang
    Yifeng Zhu
    David R. Swanson
    [J]. Cluster Computing, 2006, 9 : 297 - 311
  • [10] Improving the performance of I/O-intensive applications on clusters of workstations
    Qin, Xiao
    Jiang, Hong
    Zhu, Yifeng
    Swanson, David R.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2006, 9 (03): : 297 - 311