HBA: Distributed metadata management for large cluster-based storage systems

被引:49
|
作者
Zhu, Yifeng [1 ]
Jiang, Hong [2 ]
Wang, Jun [3 ]
Xian, Feng [2 ]
机构
[1] Univ Maine, Dept Elect & Comp Engn, Orono, ME 04473 USA
[2] Univ Nebraska, Dept Comp Sci & Engn, Lincoln, NE 68588 USA
[3] Univ Cent Florida, Sch Elect Engn & Comp Sci, Orlando, FL 32816 USA
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
distributed file systems; file system management; metadata management; Bloom filter;
D O I
10.1109/TPDS.2007.70788
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing metadata management within a group of metadata servers. This paper presents a novel technique called Hierarchical Bloom Filter Arrays (HBA) to map filenames to the metadata servers holding their metadata. Two levels of probabilistic arrays, namely, the Bloom filter arrays with different levels of accuracies, are used on each metadata server. One array, with lower accuracy and representing the distribution of the entire metadata, trades accuracy for significantly reduced memory overhead, whereas the other array, with higher accuracy, caches partial distribution information and exploits the temporal locality of file access patterns. Both arrays are replicated to all metadata servers to support fast local lookups. We evaluate HBA through extensive trace-driven simulations and implementation in Linux. Simulation results show our HBA design to be highly effective and efficient in improving the performance and scalability of file systems in clusters with 1,000 to 10,000 nodes (or superclusters) and with the amount of data in the petabyte scale or higher. Our implementation indicates that HBA can reduce the metadata operation time of a single-metadata-server architecture by a factor of up to 43.9 when the system is configured with 16 metadata servers.
引用
收藏
页码:750 / 763
页数:14
相关论文
共 50 条
  • [41] An Adaptive Metadata Management Scheme Based on Deep Reinforcement Learning for Large-Scale Distributed File Systems
    Huang, Xiuqi
    Gao, Yuanning
    Zhou, Xinyi
    Gao, Xiaofeng
    Chen, Guihai
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (06) : 2840 - 2853
  • [42] Large-Scale Data Storage and Management Scheme Based on Distributed Database Systems
    Sun, Qiao
    Deng, Bu-qiao
    Fu, Lan-mei
    Wang, Zhi-qiang
    Pei, Xu-bin
    Sun, Jia-Song
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INTELLIGENT MANUFACTURING (ITIM 2017), 2017, 142 : 14 - 17
  • [43] Key Management for Large-Scale Distributed Storage Systems
    Lim, Hoon Wei
    [J]. PUBLIC KEY INFRASTRUCTURES, SERVICES AND APPLICATIONS, 2010, 6391 : 99 - 113
  • [44] Multi-channel visualization and management on mobile devices of a cluster-based distributed application
    Gardiol, W
    Monge, F
    Giacominetto, GF
    Lamberti, F
    Montrucchio, B
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 28 - 32
  • [45] A cluster-based framework for interface analysis in large-scale aerospace systems
    Goldschmid, Josh
    Corns, Steven
    [J]. SYSTEMS ENGINEERING, 2021, 24 (05) : 339 - 351
  • [46] A Cluster-based Approach to Consensus Based Distributed Task Allocation
    Smith, Darren
    Wetherall, Jodie
    Woodhead, Steve
    Adekunle, Andrew
    [J]. 2014 22ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2014), 2014, : 428 - 431
  • [47] LaMeta: An efficient locality aware metadata management technique for an ultra-large distributed storage system
    Singh, Harcharan Jit
    Bawa, Seema
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 8323 - 8335
  • [48] Using Provenance to Boost the Metadata Prefetching in Distributed Storage Systems
    Wu, Guojin
    Deng, Yuhui
    Qin, Xiao
    [J]. PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 80 - 87
  • [49] A Distributed Cluster-Based algorithm in Wireless Sensor Networks
    Wang, Xiaomin
    Song, Chao
    [J]. FCST 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, : 733 - 737
  • [50] Cluster-Based Distributed Face Tracking in Camera Networks
    Yoder, Josiah
    Medeiros, Henry
    Park, Johnny
    Kak, Avinash C.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (10) : 2551 - 2563