HDFS Copy Storage Improvement Strategy

被引:0
|
作者
Zhang, Jing [1 ]
Sun, Hongbo [1 ]
Yuan, ShiJing [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, 9 Wenyuan Rd, Nanjing, Jiangsu, Peoples R China
关键词
Distributed file system; Copy policy; Response time; Cluster equalization;
D O I
10.1145/3358505.3358520
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To improve the reliability of data storage, HDFS adopts the multicopy storage strategy of Rack-Aware. However, it does not consider the data block's access heat during the replication process, resulting in that in the case of high concurrent access, the client's request for the hot file cannot be responded quickly, and the storage cluster's I/O throughput rate is significantly reduced. Aiming at the defects of HDFS default replica replication strategy, this paper proposes a replica replication strategy based on block heat The results show that the improved strategy is better than the HDFS default strategy in load balancing performance, the average response time of read requests of the improved strategy is 28.46% lower than the default strategy, and the response speed is significantly better than the HDFS default strategy.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 50 条
  • [1] HDFS efficiency storage strategy for big data in smart city
    Xiang, Min
    Jiang, Yuzhou
    Xia, Zhong
    Xu, Longzhang
    Huang, Chunmei
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2394 - 2398
  • [2] Optimized storage strategy research of HDFS based on vandermonde code
    Song, Bao-Yan
    Wang, Jun-Lu
    Wang, Yan
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2015, 38 (09): : 1825 - 1837
  • [3] HDFS Optimization Strategy Based On Hierarchical Storage of Hot and Cold Data
    Guan, Yuxin
    Ma, Zhiqiang
    Li, Leixiao
    [J]. 11TH CIRP CONFERENCE ON INDUSTRIAL PRODUCT-SERVICE SYSTEMS, 2019, 83 : 415 - 418
  • [4] RESEARCH AND IMPROVEMENT OF HDFS
    Tang, Xiaolong
    Tao, Zhongyu
    Tang, Panshi
    Li, Jianping
    [J]. 2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 427 - 429
  • [5] A Virtual Shared Metadata Storage for HDFS
    Zhou, Jiang
    Chen, Yong
    Gu, Xiaoyan
    Wang, Weiping
    Meng, Dan
    [J]. PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2015, : 265 - 274
  • [6] Survey on Storage and Optimization Techniques of HDFS
    Jin, Guo-Dong
    Bian, Hao-Qiong
    Chen, Yue-Guo
    Du, Xiao-Yong
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (01): : 137 - 161
  • [7] A HDFS dynamic load balancing strategy using improved niche PSO algorithm in cloud storage
    Jian, Zhiyu
    Jian, Yiwei
    [J]. INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2021, 14 (1-2) : 163 - 178
  • [8] HDFS Improvement Using Shortest Path Algorithms
    Eddoujaji, Mohamed
    Samadi, Hassan
    Bouhorma, Mohammed
    [J]. EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY, 2023, 147 : 253 - 269
  • [9] Storage and Accessing Small Files Based on HDFS
    Mao, Yingchi
    Min, Wei
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSAIT 2013), 2014, 255 : 565 - 573
  • [10] Optimizing the Storage of Massive Electronic Pedigrees in HDFS
    Zhang, Yin
    Han, Weili
    Wang, Wei
    Lei, Chang
    [J]. PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS, 2012, : 68 - 75