Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O Systems

被引:10
|
作者
He, Shuibing [1 ,2 ]
Wang, Yang [2 ]
Li, Zheng [3 ]
Sun, Xian-He [4 ]
Xu, Chenzhong [2 ]
机构
[1] Wuhan Univ, Comp Sch, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Xueyuan Blvd 1068, Shenzhen 518055, Peoples R China
[3] Western Illinois Univ, Sch Comp Sci, Macomb, IL 61455 USA
[4] IIT, Dept Comp Sci, Chicago, IL 60616 USA
基金
美国国家科学基金会;
关键词
Parallel I/O system; parallel file system; data placement; solid state drive; SCHEME; CACHE;
D O I
10.1109/TPDS.2016.2636837
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multi-tiered Parallel I/O systems that combine traditional HDDs with emerging SSDs mitigate the cost burden of SSDs while benefiting from their superior I/O performance. While a multi-tiered parallel I/O system is promising for data-intensive applications in high-performance (HPC) domains, placing data on each tier of the system to achieve high I/O performance remains a challenge. In this paper, we propose a cost-aware region-level (CARL) data placement scheme in multi-tiered parallel I/O systems. CARL divides a large file into several small regions, and then places regions on different types of servers based on region access costs. CARL includes a static policy S-CARL and a dynamic policy D-CARL. For applications whose I/O access patterns are completely known, S-CARL calculates the region costs within the entire workload duration, and uses a static data placement scheme to selectively place regions on the proper servers. To adapt to applications whose access patterns are unknown in advance, D-CARL uses a dynamic data placement scheme which migrates data among different servers within each time window. We have implemented CARL under MPI-IO library and OrangeFS parallel file system environment. Our evaluation with representative benchmarks and an application shows that CARL is both feasible and able to improve I/O performance significantly.
引用
收藏
页码:1853 / 1865
页数:13
相关论文
共 13 条
  • [1] A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O Systems
    He, Shuibing
    Sun, Xian-He
    Feng, Bo
    Huang, Xin
    Feng, Kun
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [2] Load balancing and data placement for multi-tiered database systems
    Li, Wen-Syan
    Zilio, Daniel C.
    Batra, Vishal S.
    Zuzarte, Calisto
    Narang, Inderpal
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 62 (03) : 523 - 546
  • [3] A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Wang, Yang
    Kougkas, Antonis
    Haider, Adnan
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 340 - 349
  • [4] HARL: Optimizing Parallel File Systems with Heterogeneity-Aware Region-Level Data Layout
    He, Shuibing
    Wang, Yang
    Sun, Xian-He
    Xu, Chengzhong
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (06) : 1048 - 1060
  • [5] Predicting file lifetimes for data placement in multi-tiered storage systems for HPC
    Thomas, Luis
    Gougeaud, Sebastien
    Rubini, Stephane
    Deniel, Philippe
    Boukhobza, Jalil
    [J]. OPERATING SYSTEMS REVIEW, 2021, 55 (01) : 99 - 107
  • [6] Predicting file lifetimes for data placement in multi-Tiered storage systems for HPC
    Thomas L.
    Gougeaud S.
    Rubini S.
    Deniel P.
    Boukhobza J.
    [J]. Operating Systems Review (ACM), 2021, 55 (01): : 99 - 107
  • [7] I/O Acceleration via Multi-Tiered Data Buffering and Prefetching
    Kougkas, Anthony
    Devarajan, Hariharan
    Sun, Xian-He
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (01) : 92 - 120
  • [8] I/O Acceleration via Multi-Tiered Data Buffering and Prefetching
    Anthony Kougkas
    Hariharan Devarajan
    Xian-He Sun
    [J]. Journal of Computer Science and Technology, 2020, 35 : 92 - 120
  • [9] Hermes: A Heterogeneous-Aware Multi-Tiered Distributed I/O Buffering System
    Kougkas, Anthony
    Devarajan, Hariharan
    Sun, Xian-He
    [J]. HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING: POSTERS/DOCTORAL CONSORTIUM, 2018, : 7 - 8
  • [10] Hermes: A Heterogeneous-Aware Multi-Tiered Distributed I/O Buffering System
    Kougkas, Anthony
    Devarajan, Hariharan
    Sun, Xian-He
    [J]. HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2018, : 219 - 230