Research of Massive Small Files Reading Optimization Based on Parallel Network File System

被引:1
|
作者
Yang, Hongzhang [1 ,2 ]
Zhang, Junwei [1 ]
Zeng, Xiangchao [1 ,2 ]
Dong, Huanqing [1 ]
Xu, Lu [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Small files; pre-read; pNFS; read optimization;
D O I
10.1109/HPCC-CSS-ICESS.2015.97
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the rapid development of cloud computing and big data, there are more and more small files. How to manage those massive small files efficiently and provide low-latency service is becoming a hot topic in Parallel Network File System (pNFS). When reading massive small files in pNFS, because metadata access frequency is fairly high, and disk efficiency is rather low, massive small file access performance is far lower than large file access performance. This paper presents an optimization mechanism for reading small files, including extended read dir delegation, radically metadata pre-read technology and large IO data pre-read technology between small files. These optimizations could significantly reduce the reading access latency and make full use of the client cache. The effectiveness of this optimization is proved with intensive experiments, when reading massive small files, compared with pNFS, the performance of metadata reading is 1959% higher, sequential data reading is 2436% higher, the random data reading performance is 1675% higher, and the overall performance is 1767% higher.
引用
收藏
页码:204 / 212
页数:9
相关论文
共 50 条
  • [31] Optimization Method for Storing Massive Small Files in Multi-modal Medical Data
    Zeng M.
    Zou B.-J.
    Zhang W.-S.
    Yang X.-B.
    Zhu C.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (03): : 1451 - 1469
  • [32] Optimization of Small Sized File Access Efficiency in Hadoop Distributed File System by Integrating Virtual File System Layer
    Alange, Neeta
    Mathur, Anjali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 204 - 210
  • [33] Enhancing HDFS with a full-text search system for massive small files
    Wentao Xu
    Xin Zhao
    Bin Lao
    Ge Nong
    The Journal of Supercomputing, 2021, 77 : 7149 - 7170
  • [34] Dynamic file prefetching scheme based on file access patterns in VIA-based parallel file system
    Lee, YY
    Kim, CY
    Seo, DW
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (04) : 714 - 721
  • [35] Enhancing HDFS with a full-text search system for massive small files
    Xu, Wentao
    Zhao, Xin
    Lao, Bin
    Nong, Ge
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (07): : 7149 - 7170
  • [36] A parallel file system based on spatial information object
    Huang, KY
    Li, GQ
    Liu, DS
    Zhang, WY
    NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 153 - 162
  • [37] PASS - A MULTIUSER PARALLEL FILE SYSTEM BASED ON MICROCOMPUTERS
    MILLER, LL
    INGLETT, SR
    HURSON, AR
    JOURNAL OF SYSTEMS AND SOFTWARE, 1992, 19 (01) : 75 - 83
  • [39] The Application and Research of parallel file system in electric power enterprise portal system
    Zhang Jingxin
    Zhao Yongbin
    Yu Liangliang
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3451 - 3455
  • [40] Efficient Prefetching Technique for Storage of Heterogeneous small files in Hadoop Distributed File System Federation
    Aishwarya, K.
    Ram, Arvind A.
    Sreevatson, M. C.
    Babu, Chitra
    Prabavathy, B.
    2013 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2013, : 523 - 530