Combining Buffered I/O and Direct I/O in Distributed File Systems

被引:0
|
作者
Qian, Yingjin [1 ]
Vef, Marc-Andre [2 ]
Farrell, Patrick [3 ]
Dilger, Andreas [3 ]
Li, Xi [1 ]
Ihara, Shuichi [1 ]
Fu, Yinjin [4 ]
Xue, Wei [5 ,6 ]
Brinkmann, Andre [2 ]
机构
[1] Data Direct Networks, Chatsworth, CA 91311 USA
[2] Johannes Gutenberg Univ Mainz, Mainz, Germany
[3] Whamcloud Inc, Los Angeles, CA USA
[4] Sun Yat Sen Univ, Guangzhou, Peoples R China
[5] Tsinghua Univ, Beijing, Peoples R China
[6] Qinghai Univ, Xining, Peoples R China
基金
欧盟地平线“2020”; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Direct I/O allows I/O requests to bypass the Linux page cache and was introduced over 20 years ago as an alternative to the default buffered I/O mode. However, high-performance computing (HPC) applications still mostly rely on buffered I/O, even if direct I/O could perform better in a given situation. This is because users tend to use the I/O mode they are most familiar with. Moreover, with complex distributed file systems and applications, it is often unclear which I/O mode to use. In this paper, we show under which conditions both I/O modes are beneficial and present a new transparent approach that dynamically switches to each I/O mode within the file system. Its decision is based not only on the I/O size but also on file lock contention and memory constraints. We exemplary implemented our design into the Lustre client and server and extended it with additional features, e.g., delayed allocation. Under various conditions and real-world workloads, our approach achieved up to 3x higher throughput than the original Lustre and outperformed other distributed file systems that include varying degrees of direct I/O support by up to 13x.
引用
收藏
页码:17 / 33
页数:17
相关论文
共 50 条
  • [21] I/O profiling for distributed IP storage systems
    Han, JZ
    Zhou, D
    He, XB
    Gao, JZ
    ICESS 2005: SECOND INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2005, : 581 - 586
  • [22] Parallel I/O for distributed systems: Issues and implementation
    Sunderam, VS
    Moyer, SA
    FUTURE GENERATION COMPUTER SYSTEMS, 1996, 12 (01) : 25 - 38
  • [23] Scheduling Distributed I/O Resources in HPC Systems
    Bandet, Alexis
    Boito, Francieli
    Pallez, Guillaume
    EURO-PAR 2024: PARALLEL PROCESSING, PT I, EURO-PAR 2024, 2024, 14801 : 137 - 151
  • [24] Parallel I/O Prefetching Using MPI File Caching and I/O Signatures
    Byna, Surendra
    Chen, Yong
    Sun, Xian-He
    Thakur, Rajeev
    Gropp, William
    INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2008, : 350 - +
  • [25] Automatic I/O scheduling algorithm selection for parallel file systems
    Boito, Francieli Zanon
    Kassick, Rodrigo Virote
    Navaux, Philippe O. A.
    Denneulin, Yves
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (08): : 2457 - 2472
  • [26] The dispatch time aligning I/O scheduling for parallel file systems
    Yonggang Liu
    Jing Qin
    Renato Figueiredo
    Cluster Computing, 2015, 18 : 1025 - 1039
  • [27] Optimizing I/O Operations in File Systems for Fast Storage Devices
    Son, Yongseok
    Yeom, Heon Young
    Han, Hyuck
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (06) : 1071 - 1084
  • [28] The dispatch time aligning I/O scheduling for parallel file systems
    Liu, Yonggang
    Qin, Jing
    Figueiredo, Renato
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (03): : 1025 - 1039
  • [29] Reevaluation of Programmed I/O with Write-Combining Buffers to Improve I/O Performance on Cluster Systems
    Larsen, Steen
    Lee, Ben
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2015, : 345 - 346
  • [30] OPTIMIZATION OF I/O SYSTEMS BY CACHE DISKS AND FILE MIGRATION - A SUMMARY
    SMITH, AJ
    PERFORMANCE EVALUATION, 1981, 1 (03) : 249 - 262