Improving collective I/O performance using threads

被引:19
|
作者
Dickens, PM [1 ]
Thakur, R [1 ]
机构
[1] IIT, Dept Comp Sci, Chicago, IL 60616 USA
关键词
D O I
10.1109/IPPS.1999.760432
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Massively parallel computers are increasingly being used to solve large, I/O intensive applications in many different fields. For such applications, the I/O requirements quite often present a significant obstacle in the way of achieving good performance, and an important area of current research is the development of techniques by which these costs can be reduced. One mch approach is collective I/O, where the processors cooperatively develop an I/O strategy that reduces the number and increases the size, of I/O requests, making a much better use of the I/O subsystem. Collective I/O has been shown to significantly reduce the cost of performing I/O in many large, parallel applications, and for this reason serves as an important base upon which we can explore other mechanisms which can further reduce these costs. One promising approach is to use threads to perform the collective I/O in the background while the main thread continues with other computation in the foreground In this paper we explore the issues associated with implementing collective I/O in the background using threads. The most natural approach is to simply spawn off an I/O thread to perform the collective I/O in the background while the main thread continues with other computation. However our research demonstrates that this approach is frequently the worst implementation option, often performing much more poorly than just executing collective I/O completely in the foreground. To improve the performance of thread-based collective I/O, we developed an alternate approach where part of the collective I/O operation is performed in the background and part is performed in the foreground We demonstrate that this not technique can significantly improve the performance of thread-based collective I/O, providing Icp to art 80% improvement over sequential collective I/O (where there is no attempt to overlap computation with I/O). Also, we discuss one very important application of this research which is the implementation of the split-collective parallel I/O operations defined in MPI 2.0.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [1] Improving Collective I/O Performance Using Pipelined Two-Phase I/O
    Tsujita, Yuichi
    Muguruma, Hidetaka
    Yoshinaga, Kazumi
    Hori, Atsushi
    Namiki, Mitaro
    Ishikawa, Yutaka
    [J]. HIGH PERFORMANCE COMPUTING SYMPOSIUM 2012 (HPC 2012), 2012, 44 (06): : 34 - 41
  • [2] Collective buffering: Improving parallel I/O performance
    Nitzberg, B
    Lo, V
    [J]. SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, : 148 - 157
  • [3] Improving Collective I/O Performance Using Non-Volatile Memory Devices
    Congiu, Giuseppe
    Narasimhamurthy, Sai
    Suess, Tim
    Brinkmann, Andre
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 120 - 129
  • [4] Improving Collective I/O Performance with Machine Learning Supported Auto-tuning
    Bagbaba, Ayse
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 814 - 821
  • [5] Improving the Performance of HDFS by Reducing I/O Using Adaptable I/O System
    Park, Jung Kyu
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 3139 - 3144
  • [6] Improving the Average Response Time in Collective I/O
    Jin, Chen
    Sehrish, Saba
    Liao, Wei-keng
    Choudhary, Alok
    Schuchardt, Karen
    [J]. RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2011, 6960 : 71 - +
  • [7] Improving the performance of remote I/O using asynchronous primitives
    Ali, Nawab
    Lauria, Mario
    [J]. HPDC-15: PROCEEDINGS OF THE 15TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2005, : 218 - 228
  • [8] Transparent Asynchronous Parallel I/O Using Background Threads
    Tang, Houjun
    Koziol, Quincey
    Ravi, John
    Byna, Suren
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (04) : 891 - 902
  • [9] Enabling Transparent Asynchronous I/O using Background Threads
    Tang, Houjun
    Koziol, Quincey
    Byna, Suren
    Mainzer, John
    Li, Tonglin
    [J]. PROCEEDINGS OF PDSW 2019: 2019 IEEE/ACM FOURTH INTERNATIONAL PARALLEL DATA SYSTEMS WORKSHOP (PDSW), 2019, : 11 - 19
  • [10] Performance Models for Communication in Collective I/O Operations
    Jha, Shweta
    Gabriel, Edgar
    [J]. 2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 982 - 991