Clover: A distributed file system of expandable metadata service derived from HDFS

被引:12
|
作者
Wang, Youwei [1 ]
Zhou, Jiang [1 ]
Ma, Can [2 ]
Wang, Weiping [1 ]
Meng, Dan [2 ]
Kei, Jason [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Integrat Applicat Ctr, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[3] Tencent Corp, Shenzhen, Peoples R China
关键词
Distributed file system; Metadata service; Expansibility;
D O I
10.1109/CLUSTER.2012.54
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To store and manage data efficiently is the critical issue which modern information infrastructures confront with. To accommodate the massive scale of data in the Internet environment, most common solutions utilize distributed file systems. However there still exist disadvantages preventing these systems from delivering satisfying performance. In this paper, we present a NameNode cluster file system based on HDFS, which is named Clover. This file system exploits two critical features: an improved 2PC protocol which ensures consistent metadata update on multiple metadata servers and a shared storage pool which provides robust persistent metadata storage and supports the operation of distributed transactions. Clover is compared with HDFS and its key virtues are shown. Further experimental results show our system can achieve better metadata expandability ranging from 10% to 90% by quantized metrics when each extra server is added, while preserving similar I/O performance.
引用
下载
收藏
页码:126 / 134
页数:9
相关论文
共 50 条
  • [41] Low-Latency and Scalable Full-path Indexing Metadata Service for Distributed File Systems
    Dong, Chao
    Wang, Fang
    Yang, Yuxin
    Lei, Mengya
    Zhang, Jianshun
    Feng, Dan
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 283 - 290
  • [43] Storage Service Reliability and Availability Predictions of Hadoop Distributed File System
    Chattaraj, Durbadal
    Bhagat, Sumit
    Sarma, Monalisa
    RELIABILITY, SAFETY AND HAZARD ASSESSMENT FOR RISK-BASED TECHNOLOGIES, 2020, : 617 - 626
  • [44] The Netwarp service on UNIX system and its application to a high speed distributed file system
    Yoshikawa, T
    Onoda, T
    Tsujioka, T
    DIGITAL CONVERGENCE FOR CREATIVE DIVERGENCE, VOL I: TECHNICAL SPEECH SESSIONS, 1999, : 378 - 384
  • [45] Enabling Prioritized Cloud I/O Service in Hadoop Distributed File System
    Yeh, Tsozen
    Sun, Yifeng
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 256 - 259
  • [46] QMDS: a file system metadata management service supporting a graph data model-based query language
    Ames, Sasha
    Gokhale, Maya
    Maltzahn, Carlos
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2013, 28 (02) : 159 - 183
  • [47] Ubiquitous service finder discovery of services semantically derived from metadata in ubiquitous computing
    Kawamura, T
    Ueno, K
    Nagano, S
    Hasegawa, T
    Ohsuga, A
    SEMANTIC WEB - ISWC 2005, PROCEEDINGS, 2005, 3729 : 902 - 915
  • [48] SDFS: Secure Distributed File System for Data-at-Rest Security for Hadoop-as-a-Service
    Zerfos, Petros
    Yeo, Hangu
    Paulovicks, Brent D.
    Sheinin, Vadim
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1262 - 1271
  • [49] Reclaiming space from duplicate files in a serverless distributed file system
    Douceur, JR
    Adya, A
    Bolosky, WJ
    Simon, D
    Theimer, M
    22ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2002, : 617 - 624
  • [50] MISS-D: A fast and scalable framework of medical image storage service based on distributed file system
    Li, Wei
    Feng, Chaolu
    Yu, Kun
    Zhao, Dazhe
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 186