Efficient structured data access in parallel file systems

被引:0
|
作者
Ching, A [1 ]
Choudhary, A [1 ]
Liao, WK [1 ]
Ross, R [1 ]
Gropp, W [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel scientific applications store and retrieve very large, structured datasets. Directly supporting these structured accesses is an important step in providing high-performance I/O solutions for these applications. High-level interfaces such as HDF5 and Parallel netCDF provide convenient APIs for accessing structured datasets, and the MPI-IO interface also supports efficient access to structured data. However, parallel file systems do not traditionally support such access. In this work, we present an implementation of structured data access support in the context of the Parallel Virtual File System (PVFS). We call this support "datatype I/O" because of its similarity to MPI datatypes. This support is built by using a reusable datatype-processing component from the MPICH2 MPI implementation. We describe how this component is leveraged to efficiently process structured data representations resulting from MPI-IO operations. We quantitatively assess the solution using three test applications. We also point to further optimizations in the processing path that could be leveraged for even more efficient operation.
引用
收藏
页码:326 / 335
页数:10
相关论文
共 50 条
  • [1] Small-File Access in Parallel File Systems
    Carns, Philip
    Lang, Sam
    Ross, Robert
    Vilayannur, Murali
    Kunkel, Julian
    Ludwig, Thomas
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 524 - +
  • [2] Opass: Analysis and Optimization of Parallel Data Access on Distributed File Systems
    Yin, Jiangling
    Wang, Jun
    Zhou, Jian
    Lukasiewicz, Tyler
    Huang, Dan
    Zhang, Junyao
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 623 - 632
  • [3] Achieving Load Balance for Parallel Data Access on Distributed File Systems
    Huang, Dan
    Han, Dezhi
    Wang, Jun
    Yin, Jiangling
    Chen, Xunchao
    Zhang, Xuhong
    Zhou, Jian
    Ye, Mao
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (03) : 388 - 402
  • [4] Efficient access control for distributed hierarchical file systems
    Pollack, KT
    Brandt, SA
    [J]. TWENTY-SECOND IEEE/THIRTEENTH NASA GODDARD CONFERENCE ON MASS STORAGE SYSTEMS AND TECHNOLOGIES, PROCEEDINGS: INFORMATION RETRIEVAL FROM VERY LARGE STORAGE SYSTEMS, 2005, : 253 - 260
  • [5] Structured Data Access Annotations for Massively Parallel Computations
    Aldinucci, Marco
    Campa, Sonia
    Kilpatrick, Peter
    Torquati, Massimo
    [J]. EURO-PAR 2012: PARALLEL PROCESSING WORKSHOPS, 2013, 7640 : 381 - 390
  • [6] PABIRS: A Data Access Middleware for Distributed File Systems
    Wu, Sai
    Chen, Gang
    Zhou, Xianke
    Zhang, Zhenjie
    Tung, Anthony K. H.
    Winslett, Marianne
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 113 - 124
  • [7] Efficient and Effective File Replication in Structured P2P File Sharing Systems
    Shen, Haiying
    [J]. 2009 IEEE NINTH INTERNATIONAL CONFERENCE ON PEER-TO-PEER COMPUTING (P2P 2009), 2009, : 159 - 162
  • [8] Parallel file systems
    Kuhn M.
    [J]. Informatik Spektrum, 2019, 42 (5) : 360 - 364
  • [9] SeqDLM: A Sequencer-Based Distributed Lock Manager for Efficient Shared File Access in a Parallel File System
    Chen, Qi
    Ma, Shaonan
    Chen, Kang
    Ma, Teng
    Liu, Xin
    Chen, Dexun
    Wu, Yongwei
    Chen, Zuoning
    [J]. SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [10] The use of locality information on data intensive parallel file systems
    Sugawara Junior, Ricardo Ryoiti
    Sato, Liria Matsumoto
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 167 - 173