How Fast Can One Scale Down a Distributed File System?

被引:0
|
作者
Cheriere, Nathanael [1 ]
Antoniu, Gabriel [2 ]
机构
[1] ENS Rennes, IRISA, Rennes, France
[2] INRIA, Rennes Bretagne Atlantique Res Ctr, Rennes, France
关键词
Elastic Storage; Distributed File System; Malleable File System; Model; Decommission;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For efficient Big Data processing, efficient resource utilization becomes a major concern as large-scale computing infrastructures such as supercomputers or clouds keep growing in size. Naturally, energy and cost savings can be obtained by reducing idle resources. Malleability, which is the possibility for resource managers to dynamically increase or reduce the resources of jobs, appears as a promising means to progress towards this goal. However, state-of-the-art parallel and distributed file systems have not been designed with malleability in mind. This is mainly due to the supposedly high cost of storage decommission, which is considered to involve expensive data transfers. Nevertheless, as network and storage technologies evolve, old assumptions on potential bottlenecks can be revisited. In this study, we evaluate the viability of malleability as a design principle for a distributed file system. We specifically model the duration of the decommission operation, for which we obtain a theoretical lower bound. Then we consider HDFS as a use case and we show that our model can explain the measured decommission times. The existing decommission mechanism of HDFS is good when the network is the bottleneck, but could be accelerated by up to a factor 3 when the storage is the limiting factor. With the highlights provided by our model, we suggest improvements to speed up decommission in HDFS and we discuss open perspectives for the design of efficient malleable distributed file systems.
引用
收藏
页码:141 / 150
页数:10
相关论文
共 50 条
  • [41] VSFS: A Searchable Distributed File System
    Xu, Lei
    Huang, Ziling
    Jiang, Hong
    Tian, Lei
    Swanson, David
    2014 9TH PARALLEL DATA STORAGE WORKSHOP (PDSW), 2014, : 25 - 30
  • [42] A Distributed File System Based on HDFS
    Liu J.
    Leng F.-L.
    Li S.-Q.
    Bao Y.-B.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (06): : 795 - 800
  • [43] SingularFS: A Billion-Scale Distributed File System Using a Single Metadata Server
    Guo, Hao
    Lu, Youyou
    Lv, Wenhao
    Liao, Xiaojian
    Zeng, Shaoxun
    Shu, Jiwu
    PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 915 - 928
  • [44] LPPFS: a Scale-out Distributed File System for Post-Petascale Systems
    Takatsu, Fuyumasa
    Hiraga, Kohei
    Tatebe, Osamu
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1477 - 1484
  • [45] Speculative execution in a distributed file system
    Nightingale, Edmund B.
    Chen, Peter M.
    Flinn, Jason
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2006, 24 (04): : 361 - 392
  • [46] A Virtualized Hybrid Distributed File System
    Zhou, Xingyu
    He, Liang-yu
    2013 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2013, : 202 - 205
  • [47] Research on Distributed File System with Hadoop
    Xu, JunWu
    Liang, JunLing
    NETWORK COMPUTING AND INFORMATION SECURITY, 2012, 345 : 148 - +
  • [48] RECOVERABILITY ASPECTS OF A DISTRIBUTED FILE SYSTEM
    JEGADO, M
    SOFTWARE-PRACTICE & EXPERIENCE, 1983, 13 (01): : 33 - 44
  • [49] A Distributed File System for Intermittent Power
    Sharma, Navin
    Irwin, David
    Shenoy, Prashant
    2013 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2013,
  • [50] THE EDGE NODE FILE SYSTEM: A DISTRIBUTED FILE SYSTEM FOR HIGH PERFORMANCE COMPUTING
    Ponnavaikko, Kovendhan
    Janakiram, D.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2009, 10 (01): : 115 - 130