A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems

被引:13
|
作者
He, Shuibing [1 ,2 ,3 ]
Sun, Xian-He [2 ]
Wang, Yang [4 ]
Kougkas, Antonis [2 ]
Haider, Adnan [2 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan 430072, Hubei, Peoples R China
[2] IIT, Dept Comp Sci, Chicago, IL 60616 USA
[3] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
关键词
Parallel I/O System; Parallel File system; Solid State Drive; Data Layout;
D O I
10.1109/ICPP.2015.43
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel file systems (PFS) are commonly used in high-end computing systems. With the emergence of solid state drives (SSD), hybrid PFSs, which consist of both HDD and SSD servers, provide a practical I/O system solution for data-intensive applications. However, most existing PFS layout schemes are inefficient for hybrid PFSs due to their lack of awareness of the performance differences between heterogeneous servers and the workload changes between different parts of a file. This lack of recognition can result in severe I/O performance degradation. In this study, we propose a heterogeneity-aware region-level (HARL) data layout scheme to improve the data distribution of a hybrid PFS. HARL first divides a file into fine-grained, varying sized regions according to the changes of an application's I/O workload, then chooses appropriate file stripe sizes on heterogeneous servers based on the server performance for each file region. Experimental results of representative benchmarks show that HARL can greatly improve the I/O system performance.
引用
收藏
页码:340 / 349
页数:10
相关论文
共 50 条
  • [41] Design and Implementation of a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications
    Han, Myeonggyun
    Park, Jinsu
    Baek, Woongki
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1117 - 1132
  • [42] FedDM: Data and Model Heterogeneity-Aware Federated Learning via Dynamic Weight Sharing
    Shen, Leming
    Zheng, Yuanqing
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 975 - 976
  • [43] On the impact of heterogeneity-aware mesh partitioning and non-contributing computation removal on parallel reservoir simulations
    Thune, Andreas
    Cai, Xing
    Rustad, Alf Birger
    JOURNAL OF MATHEMATICS IN INDUSTRY, 2021, 11 (01)
  • [44] On the impact of heterogeneity-aware mesh partitioning and non-contributing computation removal on parallel reservoir simulations
    Andreas Thune
    Xing Cai
    Alf Birger Rustad
    Journal of Mathematics in Industry, 11
  • [45] Facial Landmarks Based Region-Level Data Augmentation for Gaze Estimation
    Yang, Zhuo
    Ren, Luqian
    Zhu, Jian
    Wu, Wenyan
    Wang, Rui
    ADVANCES IN COMPUTER GRAPHICS, CGI 2022, 2022, 13443 : 107 - 116
  • [46] A Parallel File System with Application-Aware Data Layout Policies for Massive Remote Sensing Image Processing in Digital Earth
    Wang, Lizhe
    Ma, Yan
    Zomaya, Albert Y.
    Ranjan, Rajiv
    Chen, Dan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (06) : 1497 - 1508
  • [47] HaaS: Cloud-based Real-time Data Analytics with Heterogeneity-aware Scheduling
    He, Jiong
    Chen, Yao
    Fu, Tom Z. J.
    Long, Xin
    Winslett, Marianne
    You, Liang
    Zhang, Zhenjie
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1017 - 1028
  • [48] Multi-level heterogeneity-aware energy-efficient clustering technique for wireless sensor networks
    Sharma S.
    Bansal R.K.
    Bansal S.
    Sharma, S. (sukhwinder.sharma83@gmail.com), 1600, Begell House Inc. (79): : 903 - 917
  • [49] Toward heterogeneity-aware device-to-device data dissemination over Wi-Fi networks
    Hamidouche, Lyes
    Monnet, Sebastien
    Sens, Pierre
    Refauvelet, Dimitri
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 105 - 112
  • [50] Efficient structured data access in parallel file systems
    Ching, A
    Choudhary, A
    Liao, WK
    Ross, R
    Gropp, W
    IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2003, : 326 - 335