An Improved data placement strategy in a heterogeneous hadoop cluster

被引:0
|
作者
Zhao, Wentao [1 ,2 ]
Meng, Lingjun [1 ]
Sun, Jiangfeng [1 ,2 ]
Ding, Yang [1 ]
Zhao, Haohao [1 ]
Wang, Lina [1 ,2 ]
机构
[1] School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
[2] Opening Project of Key Laboratory of Mine Informatization, Henan Polytechnic University, Jiaozuo,Henan, China
来源
关键词
Big data;
D O I
10.2174/1874110X01509010792
中图分类号
学科分类号
摘要
Hadoop Distributed File System (HDFS) is designed to store big data reliably, and to stream these data at high bandwidth to user applications. However, the default HDFS block placement policy assumes that all nodes in the cluster are homogeneous, and randomly place blocks without considering any nodes’ resource characteristics, which decreases self-adaptability of the system. In this paper, we take account nodes heterogeneities, such as utilization of nodes’ disk space, and put forward an improved blocks placement strategy for solving some drawbacks in the default HDFS. The simulation experiments indicate that our improved strategy performs much better not only in the data distribution but also significantly saves more time than the default blocks placement. © Zhao et al.; Licensee Bentham Open.
引用
收藏
页码:792 / 798
相关论文
共 50 条
  • [1] An improved data placement strategy in a heterogeneous Hadoop cluster
    Zhao, Wentao
    Meng, Lingjun
    Sun, Jiangfeng
    Ding, Yang
    Zhao, Haohao
    Wang, Lina
    [J]. Open Cybernetics and Systemics Journal, 2014, 8 (01): : 957 - 963
  • [2] An improved data placement strategy for hadoop
    Lin, Wei-Wei
    [J]. Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2012, 40 (01): : 152 - 158
  • [3] A Dynamic Data Placement Policy for Heterogeneous Hadoop Cluster
    Shithil, Santa Maria
    Saha, Tushar Kanti
    Sharma, Tanusree
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 302 - 307
  • [4] On a Dynamic Data Placement Strategy for Heterogeneous Hadoop Clusters
    Liu, Yang
    Wu, Chase Q.
    Wang, Meng
    Hou, Aiqin
    Wang, Yongqiang
    [J]. 2018 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC 2018), 2018,
  • [5] A Dynamic Data Placement Strategy for Hadoop in Heterogeneous Environments
    Lee, Chia-Wei
    Hsieh, Kuang-Yu
    Hsieh, Sun-Yuan
    Hsiao, Hung-Chang
    [J]. BIG DATA RESEARCH, 2014, 1 : 14 - 22
  • [6] Data Prefetching for Heterogeneous Hadoop Cluster
    Vinutha, D. C.
    Raju, G. T.
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 554 - 558
  • [7] Optimizing data placement in heterogeneous Hadoop clusters
    Runqun Xiong
    Junzhou Luo
    Fang Dong
    [J]. Cluster Computing, 2015, 18 : 1465 - 1480
  • [8] Optimizing data placement in heterogeneous Hadoop clusters
    Xiong, Runqun
    Luo, Junzhou
    Dong, Fang
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1465 - 1480
  • [9] New Data Placement Strategy in the HADOOP Framework
    Elomari, Akram
    Hassouni, Larbi
    Maizate, Abderrahim
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 676 - 684
  • [10] HaDaap: A hotness-aware data placement strategy for improving storage efficiency in heterogeneous Hadoop clusters
    Xiong, Runqun
    Du, Yao
    Jin, Jiahui
    Luo, Junzhou
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (20):