Scalable Metadata Management Techniques for Ultra-Large Distributed Storage Systems - A Systematic Review

被引:9
|
作者
Singh, Harcharan Jit [1 ]
Bawa, Seema [1 ]
机构
[1] Thapar Univ, Comp Sci & Engn Dept, Patiala 147004, Punjab, India
关键词
Distributed computing; big data storage systems; distributed file system; data-intensive computing; metadata management; scalability; load balancing; locality; namespace; FILE-SYSTEMS; PERFORMANCE; SERVICE; OVERLAY; SCALE; CODA;
D O I
10.1145/3212686
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The provisioning of an efficient ultra-large scalable distributed storage system for expanding cloud applications has been a challenging job for researchers in academia and industry. In such an ultra-large-scale storage system, data are distributed onmultiple storage nodes for performance, scalability, and availability. The access to this distributed data is through itsmetadata, maintained bymultiple metadata servers. Themetadata carries information about the physical address of data and access privileges. The efficiency of a storage system highly depends on effective metadata management. This research presents an extensive systematic literature analysis of metadata management techniques in storage systems. This research work will help researchers to find the significance of metadata management and important parameters of metadata management techniques for storage systems. Methodical examination of metadata management techniques developed by various industry and research groups is described. The different metadata distribution techniques lead to various taxonomies. Furthermore, the article investigates techniques based on distribution structures and key parameters of metadata management. It also presents strengths and weaknesses of individual existing techniques that will help researchers to select the most appropriate technique for specific applications. Finally, it discusses existing challenges and significant research directions in metadata management for researchers.
引用
收藏
页数:37
相关论文
共 50 条
  • [11] Scalable management - Technologies for management of large-scale, distributed systems
    Adamst, R
    Brettt, P
    Lyer, S
    Milojicic, D
    Rafaeli, S
    Talwar, V
    [J]. ICAC 2005: SECOND INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2005, : 159 - 170
  • [12] Metadata management in global distributed storage system
    Yi, CJ
    Jin, H
    Jia, YJ
    [J]. CURRENT TRENDS IN HIGH PERFORMANCE COMPUTING AND ITS APPLICATIONS, PROCEEDINGS, 2005, : 175 - 184
  • [13] INNOVATED SCALABLE EFFICIENT ESTIMATION IN ULTRA-LARGE GAUSSIAN GRAPHICAL MODELS
    Fan, Yingying
    Lv, Jinchi
    [J]. ANNALS OF STATISTICS, 2016, 44 (05): : 2098 - 2126
  • [14] Metadata Management for Distributed Multimedia Storage System
    Zhan, Ling
    Wan, Jiguang
    Gu, Peng
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 443 - +
  • [15] A scalable switch architecture for ultra-large IP and lambda switch routers
    Hirano, M
    Aoki, M
    Matsuura, N
    Kurimoto, T
    Miyamura, T
    Goshima, M
    Urushidani, S
    [J]. ICT'2003: 10TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, VOLS I AND II, CONFERENCE PROCEEDINGS, 2003, : 1656 - 1661
  • [16] DROP: Facilitating Distributed Metadata Management in EB-scale Storage Systems
    Xu, Quanqing
    Arumugam, Rajesh Vellore
    Yong, Khai Leong
    Mahadevan, Sridhar
    [J]. 2013 IEEE 29TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2013,
  • [17] Bristrita: Namespace and Metadata Distribution in Large-Scale Distributed Cloud Storage Systems
    Dewan, Hrishikesh
    Hansdah, R. C.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2018, : 116 - 124
  • [18] Workshop on software technologies for ultra-large scale systems
    Gabriel, Richard
    Kazman, Rick
    Northrop, Linda
    Schmidt, Douglas
    Sullivan, Kevin
    [J]. 29th International Conference on Software Engineering: ICSE 2007 Companion Volume, Proceedings, 2007, : 140 - 141
  • [19] Scalable PGAS Metadata Management on Extreme Scale Systems
    Chavarria-Miranda, Daniel
    Agarwal, Khushbu
    Straatsma, T. P.
    [J]. PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 103 - 111
  • [20] Dynamic metadata management for scalable stream processing systems
    Cammert, Michael
    Kraemer, Juergen
    Seeger, Bernhard
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 644 - 653