A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems

被引:13
|
作者
He, Shuibing [1 ,2 ,3 ]
Sun, Xian-He [2 ]
Wang, Yang [4 ]
Kougkas, Antonis [2 ]
Haider, Adnan [2 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan 430072, Hubei, Peoples R China
[2] IIT, Dept Comp Sci, Chicago, IL 60616 USA
[3] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
关键词
Parallel I/O System; Parallel File system; Solid State Drive; Data Layout;
D O I
10.1109/ICPP.2015.43
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel file systems (PFS) are commonly used in high-end computing systems. With the emergence of solid state drives (SSD), hybrid PFSs, which consist of both HDD and SSD servers, provide a practical I/O system solution for data-intensive applications. However, most existing PFS layout schemes are inefficient for hybrid PFSs due to their lack of awareness of the performance differences between heterogeneous servers and the workload changes between different parts of a file. This lack of recognition can result in severe I/O performance degradation. In this study, we propose a heterogeneity-aware region-level (HARL) data layout scheme to improve the data distribution of a hybrid PFS. HARL first divides a file into fine-grained, varying sized regions according to the changes of an application's I/O workload, then chooses appropriate file stripe sizes on heterogeneous servers based on the server performance for each file region. Experimental results of representative benchmarks show that HARL can greatly improve the I/O system performance.
引用
收藏
页码:340 / 349
页数:10
相关论文
共 50 条
  • [31] Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach
    Ju, Chengyi
    Cao, Jiannong
    Yang, Yu
    Yang, Zhen-Qun
    Lee, Ho Man
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1500 - 1508
  • [32] SDPIPE: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training
    Miao, Xupeng
    Shi, Yining
    Yang, Zhi
    Cui, Bin
    Jia, Zhihao
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (09): : 2354 - 2363
  • [33] CHRT: a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications
    Han, Myeonggyun
    Park, Jinsu
    Baek, Woongki
    PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 942 - 945
  • [34] Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis
    Chen, Yuanxing
    Zhang, Qingzhao
    Ma, Shuangge
    Fang, Kuangnan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [35] A Strip Level Data Layout Strategy for Heterogeneous Parallel Storage Systems
    Huang, Xin
    Huang, Yizhi
    Liu, Yan
    Li, Renfa
    Peng, Xin
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 1085 - 1091
  • [36] Orchestration Extensions for Interference- and Heterogeneity-Aware Placement for Data-Analytics
    Tzenetopoulos, Achilleas
    Masouros, Dimosthenis
    Xydis, Sotirios
    Soudris, Dimitrios
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (04) : 298 - 323
  • [37] Heterogeneity-aware elastic provisioning in cloud-assisted edge computing systems
    Li, Chunlin
    Bai, Jingpan
    Ge, Yuan
    Luo, Youlong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 112 (112): : 1106 - 1121
  • [38] Cost-intelligent application-specific data layout optimization for parallel file systems
    Song, Huaiming
    Yin, Yanlong
    Chen, Yong
    Sun, Xian-He
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (02): : 285 - 298
  • [39] Cost-intelligent application-specific data layout optimization for parallel file systems
    Huaiming Song
    Yanlong Yin
    Yong Chen
    Xian-He Sun
    Cluster Computing, 2013, 16 : 285 - 298
  • [40] A Cost-intelligent Application-specific Data Layout Scheme for Parallel File Systems
    Song, Huaiming
    Yin, Yanlong
    Chen, Yong
    Sun, Xian-He
    HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 37 - 48