Research of Massive Small Files Reading Optimization Based on Parallel Network File System

被引:1
|
作者
Yang, Hongzhang [1 ,2 ]
Zhang, Junwei [1 ]
Zeng, Xiangchao [1 ,2 ]
Dong, Huanqing [1 ]
Xu, Lu [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Small files; pre-read; pNFS; read optimization;
D O I
10.1109/HPCC-CSS-ICESS.2015.97
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the rapid development of cloud computing and big data, there are more and more small files. How to manage those massive small files efficiently and provide low-latency service is becoming a hot topic in Parallel Network File System (pNFS). When reading massive small files in pNFS, because metadata access frequency is fairly high, and disk efficiency is rather low, massive small file access performance is far lower than large file access performance. This paper presents an optimization mechanism for reading small files, including extended read dir delegation, radically metadata pre-read technology and large IO data pre-read technology between small files. These optimizations could significantly reduce the reading access latency and make full use of the client cache. The effectiveness of this optimization is proved with intensive experiments, when reading massive small files, compared with pNFS, the performance of metadata reading is 1959% higher, sequential data reading is 2436% higher, the random data reading performance is 1675% higher, and the overall performance is 1767% higher.
引用
收藏
页码:204 / 212
页数:9
相关论文
共 50 条
  • [1] A KIND OF DISTRIBUTED FILE SYSTEM BASED ON MASSIVE SMALL FILES STORAGE
    Liu, Di
    Kuang, Shi-Jie
    2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 394 - 397
  • [2] Performance Optimization for Managing Massive Numbers of Small Files in Distributed File Systems
    Fu, Songling
    He, Ligang
    Huang, Chenlin
    Liao, Xiangke
    Li, Kenli
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (12) : 3433 - 3448
  • [3] FlatLFS: A lightweight file system for optimizing the performance of accessing massive small files
    Fu, Songling
    Liao, Xiangke
    Huang, Chenlin
    Wang, Lei
    Li, Shanshan
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2013, 35 (02): : 120 - 126
  • [4] Research of Distributed File System Based on Massive Resources and Application in the Network Teaching System
    Chen, Ping
    Li, Jianwei
    Gou, Xuerong
    PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENCE AND AWARENESS INTERNET, IET AIAI2011, 2011, : 154 - 158
  • [5] iFlatLFS: Performance Optimization for Accessing Massive Small Files
    Fu, Songling
    Huang, Chenlin
    He, Ligang
    Chaudhary, Nadeem
    Liao, Xiangke
    Yang, Shazhou
    Wang, Xiaochuan
    Li, Bao
    2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 10 - 19
  • [6] The Optimization Scheme Research of Small Files Storage Based on HDFS
    Mu, Qi
    Jia, Yikai
    Luo, Bibo
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 431 - 434
  • [7] A network file system over HTTP:: remote access and modification of files and files
    Kiselyov, O
    PROCEEDINGS OF THE FREENIX TRACK: 1999 USENIX ANNUAL TECHNICAL CONFERENCE, 1999, : 75 - 80
  • [8] Dealing with Small Files Problem in Hadoop Distributed File System
    Bende, Sachin
    Shedge, Ashree
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND VIRTUALIZATION (ICCCV) 2016, 2016, 79 : 1001 - 1012
  • [9] Storage-Optimization Method for Massive Small Files of Agricultural Resources Based on Hadoop
    Liu, Jun
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (04) : 634 - 640
  • [10] Hadoop Massive Small File Merging Technology Based on Visiting Hot-Spot and Associated File Optimization
    Peng, Jian-Feng
    Wei, Wen-Guo
    Zhao, Hui-Min
    Dai, Qing-Yun
    Xie, Gui-Yuan
    Cai, Jun
    He, Ke-Jing
    ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 517 - 524