How Fast Can One Scale Down a Distributed File System?

被引：0

作者：

Cheriere, Nathanael ^{[1
]}

Antoniu, Gabriel ^{[2
]}

机构：

[1] ENS Rennes, IRISA, Rennes, France

[2] INRIA, Rennes Bretagne Atlantique Res Ctr, Rennes, France

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2017年

关键词：

Elastic Storage; Distributed File System; Malleable File System; Model; Decommission;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For efficient Big Data processing, efficient resource utilization becomes a major concern as large-scale computing infrastructures such as supercomputers or clouds keep growing in size. Naturally, energy and cost savings can be obtained by reducing idle resources. Malleability, which is the possibility for resource managers to dynamically increase or reduce the resources of jobs, appears as a promising means to progress towards this goal. However, state-of-the-art parallel and distributed file systems have not been designed with malleability in mind. This is mainly due to the supposedly high cost of storage decommission, which is considered to involve expensive data transfers. Nevertheless, as network and storage technologies evolve, old assumptions on potential bottlenecks can be revisited. In this study, we evaluate the viability of malleability as a design principle for a distributed file system. We specifically model the duration of the decommission operation, for which we obtain a theoretical lower bound. Then we consider HDFS as a use case and we show that our model can explain the measured decommission times. The existing decommission mechanism of HDFS is good when the network is the bottleneck, but could be accelerated by up to a factor 3 when the storage is the limiting factor. With the highlights provided by our model, we suggest improvements to speed up decommission in HDFS and we discuss open perspectives for the design of efficient malleable distributed file systems.

引用

页码：141 / 150

页数：10

共 50 条

[41] VSFS: A Searchable Distributed File System
Xu, Lei
Huang, Ziling
Jiang, Hong
Tian, Lei
Swanson, David
2014 9TH PARALLEL DATA STORAGE WORKSHOP (PDSW), 2014, : 25 - 30
[42] A Distributed File System Based on HDFS
Liu J.
Leng F.-L.
Li S.-Q.
Bao Y.-B.
Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (06): : 795 - 800
[43] SingularFS: A Billion-Scale Distributed File System Using a Single Metadata Server
Guo, Hao
Lu, Youyou
Lv, Wenhao
Liao, Xiaojian
Zeng, Shaoxun
Shu, Jiwu
PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 915 - 928
[44] LPPFS: a Scale-out Distributed File System for Post-Petascale Systems
Takatsu, Fuyumasa
Hiraga, Kohei
Tatebe, Osamu
PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1477 - 1484
[45] Speculative execution in a distributed file system
Nightingale, Edmund B.
Chen, Peter M.
Flinn, Jason
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2006, 24 (04): : 361 - 392
[46] A Virtualized Hybrid Distributed File System
Zhou, Xingyu
He, Liang-yu
2013 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2013, : 202 - 205
[47] Research on Distributed File System with Hadoop
Xu, JunWu
Liang, JunLing
NETWORK COMPUTING AND INFORMATION SECURITY, 2012, 345 : 148 - +
[48] RECOVERABILITY ASPECTS OF A DISTRIBUTED FILE SYSTEM
JEGADO, M
SOFTWARE-PRACTICE & EXPERIENCE, 1983, 13 (01): : 33 - 44
[49] A Distributed File System for Intermittent Power
Sharma, Navin
Irwin, David
Shenoy, Prashant
2013 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2013,
[50] THE EDGE NODE FILE SYSTEM: A DISTRIBUTED FILE SYSTEM FOR HIGH PERFORMANCE COMPUTING
Ponnavaikko, Kovendhan
Janakiram, D.
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2009, 10 (01): : 115 - 130

← 1 2 3 4 5 →