Combining Buffered I/O and Direct I/O in Distributed File Systems

被引：0

作者：

Qian, Yingjin ^{[1
]}

Vef, Marc-Andre ^{[2
]}

Farrell, Patrick ^{[3
]}

Dilger, Andreas ^{[3
]}

Li, Xi ^{[1
]}

Ihara, Shuichi ^{[1
]}

Fu, Yinjin ^{[4
]}

Xue, Wei ^{[5
,6
]}

Brinkmann, Andre ^{[2
]}

机构：

[1] Data Direct Networks, Chatsworth, CA 91311 USA

[2] Johannes Gutenberg Univ Mainz, Mainz, Germany

[3] Whamcloud Inc, Los Angeles, CA USA

[4] Sun Yat Sen Univ, Guangzhou, Peoples R China

[5] Tsinghua Univ, Beijing, Peoples R China

[6] Qinghai Univ, Xining, Peoples R China

来源：

PROCEEDINGS OF THE 21ST USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, NSDI 24 | 2024年

基金：

欧盟地平线“2020”; 国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Direct I/O allows I/O requests to bypass the Linux page cache and was introduced over 20 years ago as an alternative to the default buffered I/O mode. However, high-performance computing (HPC) applications still mostly rely on buffered I/O, even if direct I/O could perform better in a given situation. This is because users tend to use the I/O mode they are most familiar with. Moreover, with complex distributed file systems and applications, it is often unclear which I/O mode to use. In this paper, we show under which conditions both I/O modes are beneficial and present a new transparent approach that dynamically switches to each I/O mode within the file system. Its decision is based not only on the I/O size but also on file lock contention and memory constraints. We exemplary implemented our design into the Lustre client and server and extended it with additional features, e.g., delayed allocation. Under various conditions and real-world workloads, our approach achieved up to 3x higher throughput than the original Lustre and outperformed other distributed file systems that include varying degrees of direct I/O support by up to 13x.

引用

页码：17 / 33

页数：17

共 50 条

[21] I/O profiling for distributed IP storage systems
Han, JZ
Zhou, D
He, XB
Gao, JZ
ICESS 2005: SECOND INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2005, : 581 - 586
[22] Parallel I/O for distributed systems: Issues and implementation
Sunderam, VS
Moyer, SA
FUTURE GENERATION COMPUTER SYSTEMS, 1996, 12 (01) : 25 - 38
[23] Scheduling Distributed I/O Resources in HPC Systems
Bandet, Alexis
Boito, Francieli
Pallez, Guillaume
EURO-PAR 2024: PARALLEL PROCESSING, PT I, EURO-PAR 2024, 2024, 14801 : 137 - 151
[24] Parallel I/O Prefetching Using MPI File Caching and I/O Signatures
Byna, Surendra
Chen, Yong
Sun, Xian-He
Thakur, Rajeev
Gropp, William
INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2008, : 350 - +
[25] Automatic I/O scheduling algorithm selection for parallel file systems
Boito, Francieli Zanon
Kassick, Rodrigo Virote
Navaux, Philippe O. A.
Denneulin, Yves
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (08): : 2457 - 2472
[26] The dispatch time aligning I/O scheduling for parallel file systems
Yonggang Liu
Jing Qin
Renato Figueiredo
Cluster Computing, 2015, 18 : 1025 - 1039
[27] Optimizing I/O Operations in File Systems for Fast Storage Devices
Son, Yongseok
Yeom, Heon Young
Han, Hyuck
IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (06) : 1071 - 1084
[28] The dispatch time aligning I/O scheduling for parallel file systems
Liu, Yonggang
Qin, Jing
Figueiredo, Renato
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (03): : 1025 - 1039
[29] Reevaluation of Programmed I/O with Write-Combining Buffers to Improve I/O Performance on Cluster Systems
Larsen, Steen
Lee, Ben
PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2015, : 345 - 346
[30] OPTIMIZATION OF I/O SYSTEMS BY CACHE DISKS AND FILE MIGRATION - A SUMMARY
SMITH, AJ
PERFORMANCE EVALUATION, 1981, 1 (03) : 249 - 262

← 1 2 3 4 5 →