Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers

被引：0

作者：

Tessier, Francois ^{[1
]}

Malakar, Preeti ^{[1
]}

Vishwanath, Venkatram ^{[1
]}

Jeannot, Emmanuel ^{[2
]}

Isaila, Florin ^{[3
]}

机构：

[1] Argonne Natl Lab, Argonne Leadership Comp Facil, Lemont, IL 60439 USA

[2] Inria Bordeaux Sud Ouest, Talence, France

[3] Univ Carlos III, Madrid, Spain

来源：

PROCEEDINGS OF FIRST WORKSHOP ON OPTIMIZATION OF COMMUNICATION IN HPC RUNTIME SYSTEMS (COM-HPC 2016) | 2016年

关键词：

D O I：

10.1109/COM-HPC.2016.13

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Reading and writing data efficiently from storage systems is critical for high performance data-centric applications. These I/O systems are being increasingly characterized by complex topologies and deeper memory hierarchies. Effective parallel I/O solutions are needed to scale applications on current and future supercomputers. Data aggregation is an efficient approach consisting of electing some processes in charge of aggregating data from a set of neighbors and writing the aggregated data into storage. Thus, the bandwidth use can be optimized while the contention is reduced. In this work, we take into account the network topology for mapping aggregators and we propose an optimized buffering system in order to reduce the aggregation cost. We validate our approach using micro-benchmarks and the I/O kernel of a large-scale cosmology simulation. We show improvements up to 15x faster for I/O operations compared to a standard implementation of MPI I/O.

引用

页码：73 / 81

页数：9

共 50 条

[1] TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers
Tessier, Francois
Vishwanath, Venkatram
Jeannot, Emmanuel
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 70 - 80
[2] Topology-aware algorithms for large-scale communication
Rodrigues, L
Veríssimo, P
[J]. ADVANCES IN DISTRIBUTED SYSTEMS: ADVANCED DISTRIBUTED COMPUTING: FROM ALGORITHMS TO SYSTEMS, 2000, 1752 : 127 - 156
[3] Topology-aware algorithms for large-scale communication
Rodrigues, Luís
Veríssimo, Paulo
[J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2000, 1752 : 127 - 156
[4] Large-Scale Experiment for Topology-Aware Resource Management
Georgiou, Yiannis
Mercier, Guillaume
Villiermet, Adele
[J]. EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 179 - 186
[5] Topology-Aware Mappings for Large-Scale Eigenvalue Problems
Aktulga, Hasan Metin
Yang, Chao
Ng, Esmond G.
Maris, Pieter
Vary, James P.
[J]. EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 830 - 842
[6] Topology-aware Sparse Allreduce for Large-scale Deep Learning
Thao Nguyen Truong
Wahib, Mohamed
Takano, Ryousei
[J]. 2019 IEEE 38TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2019,
[7] A scheduling framework for large-scale, parallel, and topology-aware applications
Kravtsov, Valentin
Bar, Pavel
Carmeli, David
Schuster, Assaf
Swain, Martin
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 983 - 992
[8] Improving Large-scale Storage System Performance via Topology-aware and Balanced Data Placement
Wang, Feiyi
Oral, Sarp
Gupta, Saurabh
Tiwari, Devesh
Vazhkudai, Sudharshan S.
[J]. 2014 20TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2014, : 656 - 663
[9] Improving Collective MPI-IO Using Topology-Aware Stepwise Data Aggregation with I/O Throttling
Tsujita, Yuichi
Hori, Atsushi
Kameyama, Toyohisa
Uno, Atsuya
Shoji, Fumiyoshi
Ishikawa, Yutaka
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2018), 2018, : 12 - 23
[10] Communication Characterization and Optimization of Applications Using Topology-Aware Task Mapping on Large Supercomputers
Sreepathi, Sarat
D'Azevedo, Ed
Philip, Bobby
Worley, Patrick
[J]. PROCEEDINGS OF THE 2016 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE'16), 2016, : 225 - 236

← 1 2 3 4 5 →