Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers

被引:0
|
作者
Tessier, Francois [1 ]
Malakar, Preeti [1 ]
Vishwanath, Venkatram [1 ]
Jeannot, Emmanuel [2 ]
Isaila, Florin [3 ]
机构
[1] Argonne Natl Lab, Argonne Leadership Comp Facil, Lemont, IL 60439 USA
[2] Inria Bordeaux Sud Ouest, Talence, France
[3] Univ Carlos III, Madrid, Spain
关键词
D O I
10.1109/COM-HPC.2016.13
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Reading and writing data efficiently from storage systems is critical for high performance data-centric applications. These I/O systems are being increasingly characterized by complex topologies and deeper memory hierarchies. Effective parallel I/O solutions are needed to scale applications on current and future supercomputers. Data aggregation is an efficient approach consisting of electing some processes in charge of aggregating data from a set of neighbors and writing the aggregated data into storage. Thus, the bandwidth use can be optimized while the contention is reduced. In this work, we take into account the network topology for mapping aggregators and we propose an optimized buffering system in order to reduce the aggregation cost. We validate our approach using micro-benchmarks and the I/O kernel of a large-scale cosmology simulation. We show improvements up to 15x faster for I/O operations compared to a standard implementation of MPI I/O.
引用
收藏
页码:73 / 81
页数:9
相关论文
共 50 条
  • [1] TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers
    Tessier, Francois
    Vishwanath, Venkatram
    Jeannot, Emmanuel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 70 - 80
  • [2] Topology-aware algorithms for large-scale communication
    Rodrigues, L
    Veríssimo, P
    [J]. ADVANCES IN DISTRIBUTED SYSTEMS: ADVANCED DISTRIBUTED COMPUTING: FROM ALGORITHMS TO SYSTEMS, 2000, 1752 : 127 - 156
  • [3] Topology-aware algorithms for large-scale communication
    Rodrigues, Luís
    Veríssimo, Paulo
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2000, 1752 : 127 - 156
  • [4] Large-Scale Experiment for Topology-Aware Resource Management
    Georgiou, Yiannis
    Mercier, Guillaume
    Villiermet, Adele
    [J]. EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 179 - 186
  • [5] Topology-Aware Mappings for Large-Scale Eigenvalue Problems
    Aktulga, Hasan Metin
    Yang, Chao
    Ng, Esmond G.
    Maris, Pieter
    Vary, James P.
    [J]. EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 830 - 842
  • [6] Topology-aware Sparse Allreduce for Large-scale Deep Learning
    Thao Nguyen Truong
    Wahib, Mohamed
    Takano, Ryousei
    [J]. 2019 IEEE 38TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2019,
  • [7] A scheduling framework for large-scale, parallel, and topology-aware applications
    Kravtsov, Valentin
    Bar, Pavel
    Carmeli, David
    Schuster, Assaf
    Swain, Martin
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 983 - 992
  • [8] Improving Large-scale Storage System Performance via Topology-aware and Balanced Data Placement
    Wang, Feiyi
    Oral, Sarp
    Gupta, Saurabh
    Tiwari, Devesh
    Vazhkudai, Sudharshan S.
    [J]. 2014 20TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2014, : 656 - 663
  • [9] Improving Collective MPI-IO Using Topology-Aware Stepwise Data Aggregation with I/O Throttling
    Tsujita, Yuichi
    Hori, Atsushi
    Kameyama, Toyohisa
    Uno, Atsuya
    Shoji, Fumiyoshi
    Ishikawa, Yutaka
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2018), 2018, : 12 - 23
  • [10] Communication Characterization and Optimization of Applications Using Topology-Aware Task Mapping on Large Supercomputers
    Sreepathi, Sarat
    D'Azevedo, Ed
    Philip, Bobby
    Worley, Patrick
    [J]. PROCEEDINGS OF THE 2016 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE'16), 2016, : 225 - 236