MetaFlow: A Scalable Metadata Lookup Service for Distributed File Systems in Data Centers

被引:2
|
作者
Sun, Peng [1 ]
Wen, Yonggang [2 ]
Duong Nguyen Binh Ta [2 ]
Xie, Haiyong [3 ]
机构
[1] Nanyang Technol Univ, Energy Res Inst, Interdisciplinary Grad Sch, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] China Acad Elect & Informat Technol, Beijing 100041, Peoples R China
关键词
Metadata management; software-defined networking; B-tree; big data; MANAGEMENT;
D O I
10.1109/TBDATA.2016.2612241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In large-scale distributed file systems, efficient metadata operations are critical since most file operations have to interact with metadata servers first. In existing distributed hash table (DHT) based metadata management systems, the lookup service could be a performance bottleneck due to its significant CPU overhead. Our investigations showed that the lookup service could reduce system throughput by up to 70 percent, and increase system latency by a factor of up to 8 compared to ideal scenarios. In this paper, we present MetaFlow, a scalable metadata lookup service utilizing software-defined networking (SDN) techniques to distribute lookup workload over network components. MetaFlow tackles the lookup bottleneck problem by leveraging B-tree, which is constructed over the physical topology, to manage flow tables for SDN-enabled switches. Therefore, metadata requests can be forwarded to appropriate servers using only switches. Extensive performance evaluations in both simulations and testbed showed that MetaFlow increases system throughput by a factor of up to 3.2, and reduce system latency by a factor of up to 5 compared to DHT-based systems. We also deployed MetaFlow in a distributed file system, and demonstrated significant performance improvement.
引用
收藏
页码:203 / 216
页数:14
相关论文
共 50 条
  • [41] Two-level Hash/Table approach for metadata management in distributed file systems
    Diaz, Antonio F.
    Anguita, Mancia
    Camacho, Hugo E.
    Nieto, Erik
    Ortega, Julio
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 64 (01): : 144 - 155
  • [42] Scalable Data Management in Distributed Information Systems
    Remedios Pallardo-Lozoya, M.
    Esparza-Peidro, Javier
    Garcia-Escriva, Jose-Ramon
    Decker, Hendrik
    Munoz-Escoi, Francesc D.
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2011 WORKSHOPS, 2011, 7046 : 208 - 217
  • [43] PABIRS: A Data Access Middleware for Distributed File Systems
    Wu, Sai
    Chen, Gang
    Zhou, Xianke
    Zhang, Zhenjie
    Tung, Anthony K. H.
    Winslett, Marianne
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 113 - 124
  • [44] Datacast: A Scalable and Efficient Reliable Group Data Delivery Service for Data Centers
    Cao, Jiaxin
    Guo, Chuanxiong
    Lu, Guohan
    Xiong, Yongqiang
    Zheng, Yixin
    Zhang, Yongguang
    Zhu, Yibo
    Chen, Chen
    Tian, Ye
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2013, 31 (12) : 2632 - 2645
  • [45] MetaWBC: POSIX-Compliant Metadata Write-Back Caching for Distributed File Systems
    Qian, Yingjin
    Cheng, Wen
    Zeng, Lingfang
    Vef, Marc-Andre
    Drokin, Oleg
    Dilger, Andreas
    Ihara, Shuichi
    Zhang, Wusheng
    Wang, Yang
    Brinkmann, Andre
    [J]. SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [46] An End-to-End Learning-Based Metadata Management Approach for Distributed File Systems
    Gao, Yuanning
    Gao, Xiaofeng
    Zhang, Ruisi
    Chen, Guihai
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1021 - 1034
  • [47] Design and Implementation of a Non-Shared Metadata Server Cluster for Large Distributed File Systems
    Yun, Jong-Hyeon
    Park, Yong-Hun
    Lee, Seok-Jae
    Jang, Su-Min
    Yoo, Jae-Soo
    [J]. CSA 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND ITS APPLICATIONS, PROCEEDINGS, 2008, : 343 - 346
  • [48] Scalable Distributed Cloud Data Storage Service for Internet of Things
    Shwe, Hnin Yu
    Chong, Peter Han Joo
    [J]. 2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 869 - 873
  • [49] NuMessage: Providing Scalable and Reliable Messaging Service in Distributed Systems
    Liu, Lubin
    Liu, Tong
    Wang, Xinglang
    Xiao, Tao
    Fang, Wei
    Chen, HongYue
    [J]. WEB ENGINEERING, ICWE 2020, 2020, 12128 : 102 - 110
  • [50] BitDew: A data management and distribution service with multi-protocol file transfer and metadata abstraction
    Fedak, Gilles
    He, Haiwu
    Cappello, Franck
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2009, 32 (05) : 961 - 975