Improving collective I/O performance using threads

被引：19

作者：

Dickens, PM ^{[1
]}

Thakur, R ^{[1
]}

机构：

[1] IIT, Dept Comp Sci, Chicago, IL 60616 USA

来源：

IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS | 1999年

关键词：

D O I：

10.1109/IPPS.1999.760432

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Massively parallel computers are increasingly being used to solve large, I/O intensive applications in many different fields. For such applications, the I/O requirements quite often present a significant obstacle in the way of achieving good performance, and an important area of current research is the development of techniques by which these costs can be reduced. One mch approach is collective I/O, where the processors cooperatively develop an I/O strategy that reduces the number and increases the size, of I/O requests, making a much better use of the I/O subsystem. Collective I/O has been shown to significantly reduce the cost of performing I/O in many large, parallel applications, and for this reason serves as an important base upon which we can explore other mechanisms which can further reduce these costs. One promising approach is to use threads to perform the collective I/O in the background while the main thread continues with other computation in the foreground In this paper we explore the issues associated with implementing collective I/O in the background using threads. The most natural approach is to simply spawn off an I/O thread to perform the collective I/O in the background while the main thread continues with other computation. However our research demonstrates that this approach is frequently the worst implementation option, often performing much more poorly than just executing collective I/O completely in the foreground. To improve the performance of thread-based collective I/O, we developed an alternate approach where part of the collective I/O operation is performed in the background and part is performed in the foreground We demonstrate that this not technique can significantly improve the performance of thread-based collective I/O, providing Icp to art 80% improvement over sequential collective I/O (where there is no attempt to overlap computation with I/O). Also, we discuss one very important application of this research which is the implementation of the split-collective parallel I/O operations defined in MPI 2.0.

引用

页码：38 / 45

页数：8

共 50 条

[1] Improving Collective I/O Performance Using Pipelined Two-Phase I/O
Tsujita, Yuichi
Muguruma, Hidetaka
Yoshinaga, Kazumi
Hori, Atsushi
Namiki, Mitaro
Ishikawa, Yutaka
[J]. HIGH PERFORMANCE COMPUTING SYMPOSIUM 2012 (HPC 2012), 2012, 44 (06): : 34 - 41
[2] Collective buffering: Improving parallel I/O performance
Nitzberg, B
Lo, V
[J]. SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, : 148 - 157
[3] Improving Collective I/O Performance Using Non-Volatile Memory Devices
Congiu, Giuseppe
Narasimhamurthy, Sai
Suess, Tim
Brinkmann, Andre
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 120 - 129
[4] Improving Collective I/O Performance with Machine Learning Supported Auto-tuning
Bagbaba, Ayse
[J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 814 - 821
[5] Improving the Performance of HDFS by Reducing I/O Using Adaptable I/O System
Park, Jung Kyu
[J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 3139 - 3144
[6] Improving the Average Response Time in Collective I/O
Jin, Chen
Sehrish, Saba
Liao, Wei-keng
Choudhary, Alok
Schuchardt, Karen
[J]. RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2011, 6960 : 71 - +
[7] Improving the performance of remote I/O using asynchronous primitives
Ali, Nawab
Lauria, Mario
[J]. HPDC-15: PROCEEDINGS OF THE 15TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2005, : 218 - 228
[8] Transparent Asynchronous Parallel I/O Using Background Threads
Tang, Houjun
Koziol, Quincey
Ravi, John
Byna, Suren
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (04) : 891 - 902
[9] Enabling Transparent Asynchronous I/O using Background Threads
Tang, Houjun
Koziol, Quincey
Byna, Suren
Mainzer, John
Li, Tonglin
[J]. PROCEEDINGS OF PDSW 2019: 2019 IEEE/ACM FOURTH INTERNATIONAL PARALLEL DATA SYSTEMS WORKSHOP (PDSW), 2019, : 11 - 19
[10] Performance Models for Communication in Collective I/O Operations
Jha, Shweta
Gabriel, Edgar
[J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 982 - 991

← 1 2 3 4 5 →