Combining Buffered I/O and Direct I/O in Distributed File Systems

被引:0
|
作者
Qian, Yingjin [1 ]
Vef, Marc-Andre [2 ]
Farrell, Patrick [3 ]
Dilger, Andreas [3 ]
Li, Xi [1 ]
Ihara, Shuichi [1 ]
Fu, Yinjin [4 ]
Xue, Wei [5 ,6 ]
Brinkmann, Andre [2 ]
机构
[1] Data Direct Networks, Chatsworth, CA 91311 USA
[2] Johannes Gutenberg Univ Mainz, Mainz, Germany
[3] Whamcloud Inc, Los Angeles, CA USA
[4] Sun Yat Sen Univ, Guangzhou, Peoples R China
[5] Tsinghua Univ, Beijing, Peoples R China
[6] Qinghai Univ, Xining, Peoples R China
基金
欧盟地平线“2020”; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Direct I/O allows I/O requests to bypass the Linux page cache and was introduced over 20 years ago as an alternative to the default buffered I/O mode. However, high-performance computing (HPC) applications still mostly rely on buffered I/O, even if direct I/O could perform better in a given situation. This is because users tend to use the I/O mode they are most familiar with. Moreover, with complex distributed file systems and applications, it is often unclear which I/O mode to use. In this paper, we show under which conditions both I/O modes are beneficial and present a new transparent approach that dynamically switches to each I/O mode within the file system. Its decision is based not only on the I/O size but also on file lock contention and memory constraints. We exemplary implemented our design into the Lustre client and server and extended it with additional features, e.g., delayed allocation. Under various conditions and real-world workloads, our approach achieved up to 3x higher throughput than the original Lustre and outperformed other distributed file systems that include varying degrees of direct I/O support by up to 13x.
引用
收藏
页码:17 / 33
页数:17
相关论文
共 50 条
  • [1] Combining Buffered I/O and Direct I/O in Distributed File Systems
    Qian, Yingjin
    Vef, Marc-Andre
    Farrell, Patrick
    Dilger, Andreas
    Li, Xi
    Ihara, Shuichi
    Fu, Yinjin
    Xue, Wei
    Brinkmann, Andre
    PROCEEDINGS OF THE 22ND USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, FAST 24, 2024, : 17 - 33
  • [2] Efficient I/O Merging Scheme for Distributed File Systems
    An, Byoung Chul
    Sung, Hanul
    SYMMETRY-BASEL, 2023, 15 (02):
  • [3] Block I/O Scheduling on Storage Servers of Distributed File Systems
    Liao, Jianwei
    Yin, Dong
    Peng, Xiaoning
    JOURNAL OF GRID COMPUTING, 2018, 16 (02) : 299 - 316
  • [4] Block I/O Scheduling on Storage Servers of Distributed File Systems
    Jianwei Liao
    Dong Yin
    Xiaoning Peng
    Journal of Grid Computing, 2018, 16 : 299 - 316
  • [5] Streamlining distributed Deep Learning I/O with ad hoc file systems
    Schimmelpfennig, Frederic
    Vef, Marc-Andre
    Salkhordeh, Reza
    Miranda, Alberto
    Nou, Ramon
    Brinkmann, Andre
    2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, : 169 - 180
  • [6] BUFFERED DISK I/O
    BLUM, R
    DR DOBBS JOURNAL, 1985, 10 (02): : 98 - &
  • [7] DFS-Container: Achieving Containerized Block I/O for Distributed File Systems
    Huang, Dan
    Wang, Jun
    Liu, Qing
    Zhang, Xuhong
    Chen, Xunchao
    Zhou, Jian
    PROCEEDINGS OF THE 2017 SYMPOSIUM ON CLOUD COMPUTING (SOCC '17), 2017, : 660 - 660
  • [8] Efficient distributed file I/O for visualization in grid environments
    Benger, W
    Hege, HC
    Merzky, A
    Radke, T
    Seidel, E
    SIMULATION AND VISUALIZATION ON THE GRID, PROCEEDINGS, 2000, 13 : 1 - 16
  • [9] Revitalizing Buffered I/O: Optimizing Page Reclaim and I/O Throttling
    Kim, Jongseok
    Yu, Chanu
    Seo, Euiseong
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 475 - 482
  • [10] DCA-IO: A Dynamic I/O Control Scheme for Parallel and Distributed File Systems
    Kim, Sunggon
    Sim, Alex
    Wu, Kesheng
    Byna, Suren
    Wang, Teng
    Son, Yongseok
    Eom, Hyeonsang
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 351 - 360