Hierarchical Collective I/O Scheduling for High-Performance Computing

被引:9
|
作者
Liu, Jialin [1 ]
Zhuang, Yu [1 ]
Chen, Yong [1 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Collective I/O; Scheduling; High-performance computing; Big data; Data intensive computing;
D O I
10.1016/j.bdr.2015.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The non-contiguous access pattern of many scientific applications results in a large number of I/O requests, which can seriously limit the data-access performance. Collective I/O has been widely used to address this issue. However, the performance of collective I/O could be dramatically degraded in today's high-performance computing systems due to the increasing shuffle cost caused by highly concurrent data accesses. This situation tends to be even worse as many applications become more and more data intensive. Previous research has primarily focused on optimizing I/O access cost in collective I/O but largely ignored the shuffle cost involved. Previous works assume that the lowest average response time leads to the best QoS and performance, while that is not always true for collective requests when considering the additional shuffle cost. In this study, we propose a new hierarchical I/O scheduling (HIO) algorithm to address the increasing shuffle cost in collective I/O. The fundamental idea is to schedule applications' I/O requests based on a shuffle cost analysis to achieve the optimal overall performance, instead of achieving optimal I/O accesses only. The algorithm is currently evaluated with the MPICH3 and PVFS2. Both theoretical analysis and experimental tests show that the proposed hierarchical I/O scheduling has a potential in addressing the degraded performance issue of collective I/O with highly concurrent accesses. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 50 条
  • [1] Hierarchical I/O Scheduling for Collective I/O
    Liu, Jialin
    Chen, Yong
    Zhuang, Yu
    [J]. PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 211 - 218
  • [2] A Checkpoint of Research on Parallel I/O for High-Performance Computing
    Boito, Francieli Zanon
    Inacio, Eduardo C.
    Bez, Jean Luca
    Navaux, Philippe O. A.
    Dantas, Mario A. R.
    Denneulin, Yves
    [J]. ACM COMPUTING SURVEYS, 2018, 51 (02)
  • [3] Scalable I/O Forwarding Framework for High-Performance Computing Systems
    Ali, Nawab
    Carns, Philip
    Iskra, Kamil
    Kimpe, Dries
    Lang, Samuel
    Latham, Robert
    Ross, Robert
    Ward, Lee
    Sadayappan, P.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 86 - +
  • [4] An efficient adaptive scheduling policy for high-performance computing
    Abawajy, J. H.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (03): : 364 - 370
  • [5] iTransformer: Using SSD to Improve Disk Scheduling for High-performance I/O
    Zhang, Xuechen
    Davis, Kei
    Jiang, Song
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 715 - 726
  • [6] TECHNIQUES FOR SCHEDULING I/O IN A HIGH-PERFORMANCE MULTIMEDIA-ON-DEMAND SERVER
    JADAV, D
    SRINILTA, C
    CHOUDHARY, A
    BERRA, PB
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 30 (02) : 190 - 203
  • [7] IKAROS: A scalable I/O framework for high-performance computing systems.
    Filippidis, Christos
    Tsanakas, Panayiotis
    Cotronis, Yiannis
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 118 : 277 - 287
  • [8] Probabilistic scheduling of dynamic I/O requests via application clustering for burst-buffers equipped high-performance computing
    Zha, Benbo
    Shen, Hong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19):
  • [9] Parallel Simulation of Tasks Scheduling and Scheduling Criteria in High-performance Computing Systems
    Skrinarova, Jarmila
    Povinsky, Michal
    [J]. JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2019, 43 (02) : 211 - 228
  • [10] Modeling I/O performance variability in high-performance computing systems using mixture distributions
    Xu, Li
    Wang, Yueyao
    Lux, Thomas
    Chang, Tyler
    Bernard, Jon
    Li, Bo
    Hong, Yili
    Cameron, Kirk
    Watson, Layne
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 139 : 87 - 98