Erasure Code of Small File in a Distributed File System

被引:0
|
作者
Chen, Xinhai [1 ]
Liu, Jie [1 ]
Xie, Peizhen [1 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Changsha, Hunan, Peoples R China
基金
中国博士后科学基金; 国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Small file; index file; erasure code; block; distributed storage system;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the development of Internet applications, the data storage of many applications has some new characters. There always exists a huge amount of pictures in distributed file systems, and the size of these pictures is usually no more than 1M bytes. To meet the demands in small file storage system which contains so many pictures, we provide a kind of distributed file system to store large amounts of small files. In our system, small files (actual data files) are merged into large files (default 64M) which we call block, and we use two-indexed way to reduce the pressure of nameserver, which maintains the information of correspondence between blocks and dataservers. The result shows that the memory usage of block information in nameserver is less than 2G when the system capacity is 1PB. It shows good scalability. Meanwhile, to reduce the cost of storage, we introduce the technique of erasure code which is an alternative offers the same data protection but reduces significantly the storage consumption. After grouping, the cost of storage dropped by 25% compared with 2 replications.
引用
收藏
页码:2549 / 2554
页数:6
相关论文
共 50 条
  • [1] Decentralised erasure code for Hadoop distributed cloud file systems
    Mohana Prasad, K.
    Kiriti, S.
    Reddy, V.T. Sudharshan
    John, Albert Mayan
    [J]. International Journal of Cloud Computing, 2022, 11 (5-6) : 552 - 559
  • [2] ROVER: Robust and Verifiable Erasure Code for Hadoop Distributed File Systems
    Wang, Teng
    Nam Son Nguyen
    Wang, Jiayin
    Li, Tengpeng
    Zhang, Xiaoqian
    Mi, Ningfang
    Zhao, Bin
    Sheng, Bo
    [J]. 2018 27TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2018,
  • [3] A Model of Erasure-Coding-Based Distributed File System
    Han, Yuanbo
    Li, Hui
    Hou, Hanxu
    [J]. IEEE ASIA PACIFIC CLOUD COMPUTING CONGRESS 2012, 2012, : 90 - 93
  • [4] An effective strategy for improving small file problem in distributed file system
    Wang, Tao
    Yao, Shihong
    Xu, Zhengquan
    Xiong, Lian
    Gu, Xin
    Yang, Xiping
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING ICISCE 2015, 2015, : 122 - 126
  • [5] Performance Study on Indexing and Accessing of Small File in Hadoop Distributed File System
    Rodrigues, Anisha P.
    Fernandes, Roshan
    Vijaya, P.
    Chander, Satish
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2021, 20 (04)
  • [6] Optimization of Small Sized File Access Efficiency in Hadoop Distributed File System by Integrating Virtual File System Layer
    Alange, Neeta
    Mathur, Anjali
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 204 - 210
  • [7] Write Bandwidth Optimization of Online Erasure Code Based Cluster File System
    Yan, Lin
    Xing, Jing
    Wang, Tian
    Huo, Zhigang
    Ma, Jie
    Zhang, Peiheng
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [8] A Small File Performance Optimization Algorithm on P2P Distributed File System
    Zhang, Yuchang
    Zhang, Qifei
    Chen, Yingzhuang
    Tu, Chaofan
    Liu, Erteng
    Ren, Jie
    Xue, Yinchao
    [J]. 2015 IEEE 16TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2015, : 487 - 492
  • [9] Optimizing file availability in a secure serverless distributed file system
    Douceur, JR
    Wattenhofer, RP
    [J]. 20TH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2001, : 4 - 13
  • [10] A Distributed File System for Frequency Reading of Various File Sizes
    Ma, Pengfei
    Yin, Yanshen
    Lan, Chao
    Zhang, Yong
    Xing, Chunxiao
    [J]. 2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 339 - +