MetaFlow: A Scalable Metadata Lookup Service for Distributed File Systems in Data Centers

被引:2
|
作者
Sun, Peng [1 ]
Wen, Yonggang [2 ]
Duong Nguyen Binh Ta [2 ]
Xie, Haiyong [3 ]
机构
[1] Nanyang Technol Univ, Energy Res Inst, Interdisciplinary Grad Sch, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] China Acad Elect & Informat Technol, Beijing 100041, Peoples R China
关键词
Metadata management; software-defined networking; B-tree; big data; MANAGEMENT;
D O I
10.1109/TBDATA.2016.2612241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In large-scale distributed file systems, efficient metadata operations are critical since most file operations have to interact with metadata servers first. In existing distributed hash table (DHT) based metadata management systems, the lookup service could be a performance bottleneck due to its significant CPU overhead. Our investigations showed that the lookup service could reduce system throughput by up to 70 percent, and increase system latency by a factor of up to 8 compared to ideal scenarios. In this paper, we present MetaFlow, a scalable metadata lookup service utilizing software-defined networking (SDN) techniques to distribute lookup workload over network components. MetaFlow tackles the lookup bottleneck problem by leveraging B-tree, which is constructed over the physical topology, to manage flow tables for SDN-enabled switches. Therefore, metadata requests can be forwarded to appropriate servers using only switches. Extensive performance evaluations in both simulations and testbed showed that MetaFlow increases system throughput by a factor of up to 3.2, and reduce system latency by a factor of up to 5 compared to DHT-based systems. We also deployed MetaFlow in a distributed file system, and demonstrated significant performance improvement.
引用
收藏
页码:203 / 216
页数:14
相关论文
共 50 条
  • [31] Data and metadata management in distributed environmental information systems
    Koschel, A
    Kramer, R
    Nikolai, R
    Lukacs, G
    Heinemeier, T
    [J]. ENVIRONMENTAL SOFTWARE SYSTEMS, VOL 2, 1997, : 144 - 151
  • [32] Adaptive and scalable load balancing for metadata server cluster in cloud-scale file systems
    Quanqing XU
    Rajesh Vellore ARUMUGAM
    Khai Leong YONG
    Yonggang WEN
    YewSoon ONG
    Weiya XI
    [J]. Frontiers of Computer Science., 2015, 9 (06) - 918
  • [33] Adaptive and scalable load balancing for metadata server cluster in cloud-scale file systems
    Xu, Quanqing
    Arumugam, Rajesh Vellore
    Yong, Khai Leong
    Wen, Yonggang
    Ong, Yew-Soon
    Xi, Weiya
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2015, 9 (06) : 904 - 918
  • [34] Adaptive and scalable load balancing for metadata server cluster in cloud-scale file systems
    Quanqing Xu
    Rajesh Vellore Arumugam
    Khai Leong Yong
    Yonggang Wen
    Yew-Soon Ong
    Weiya Xi
    [J]. Frontiers of Computer Science, 2015, 9 : 904 - 918
  • [35] CFS: Scaling Metadata Service for Distributed File System via Pruned Scope of Critical Sections
    Wang, Yiduo
    Wu, Yufei
    Li, Cheng
    Zheng, Pengfei
    Cao, Biao
    Sun, Yan
    Zhou, Fei
    Xu, Yinlong
    Wang, Yao
    Xie, Guangjun
    [J]. PROCEEDINGS OF THE EIGHTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, EUROSYS 2023, 2023, : 331 - 346
  • [36] A DISTRIBUTED AND SCALABLE OPTICAL PACKET SWITCH ARCHITECTURE FOR DATA CENTERS
    Chen, Shanqing
    Huang, Shanguo
    Kong, Qian
    Liu, Jun
    Yin, Shan
    Zhang, Min
    Zhang, Jie
    [J]. 2014 13TH INTERNATIONAL CONFERENCE ON OPTICAL COMMUNICATIONS AND NETWORKS (ICOCN), 2014,
  • [37] DeepHash: An End-to-End Learning Approach for Metadata Management in Distributed File Systems
    Gao, Yuanning
    Gao, Xiaofeng
    Chen, Guihai
    [J]. PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [38] Two-level Hash/Table approach for metadata management in distributed file systems
    Antonio F. Díaz
    Mancia Anguita
    Hugo E. Camacho
    Erik Nieto
    Julio Ortega
    [J]. The Journal of Supercomputing, 2013, 64 : 144 - 155
  • [39] High Performance Metadata Management Engine for Large-Scale Distributed File Systems
    Cha, Myung-Hoon
    Lee, Sang-Min
    Kim, Dong-Oh
    Kim, Hong-Yeon
    Kim, Young-Kyun
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING (FGCN), 2015, : 29 - 32
  • [40] Scalable, distributed data structures for Internet service construction
    Gribble, SD
    Brewer, EA
    Hellerstein, JM
    Culler, D
    [J]. USENIX ASSOCIATION PROCEEDINGS OF THE FOURTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2000, : 319 - 332