An Efficient and Metadata-Aware Big Data Storage Architecture

被引:0
|
作者
Jin, Rize [1 ]
Paik, Joon-Young [1 ]
Biadgie, Yenewondim [2 ]
机构
[1] Tiangong Univ, Sch Comp Sci & Technol, Tianjin 300160, Peoples R China
[2] Ajou Univ, Dept Software & Comp Engn, Suwon 16499, South Korea
基金
中国国家自然科学基金;
关键词
Big data; Small file; File compaction; Metadata management;
D O I
10.1007/978-3-030-59413-8_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a hash partitioning-based file compaction design to improve the efficiency of storing and accessing small files in big data storage systems. The proposed approach consists of a file compaction tool and an access interface. The compaction tool merges a group (usually a directory) of small files into a set of "big files" to reduce the metadata required to be maintained in the on-chip memory. The data locality and tree structure of those small files are preserved. The access interface is designed to provide transparent access to the small files in the big files. Experimental results confirm that the proposed approach lead to a significantly enhancement in terms of namespace usage and access speed.
引用
收藏
页码:146 / 152
页数:7
相关论文
共 50 条
  • [1] Metadata-Aware End-to-End Keyword Spotting
    Liu, Hongyi
    Abhyankar, Apurva
    Mishchenko, Yuriy
    Senechal, Thibaud
    Fu, Gengshen
    Kulis, Brian
    Stein, Noah
    Shah, Anish
    Vitaladevuni, Shiv Naga Prasad
    [J]. INTERSPEECH 2020, 2020, : 2282 - 2286
  • [2] MATCH: Metadata-Aware Text Classification in A Large Hierarchy
    Zhang, Yu
    Shen, Zhihong
    Dong, Yuxiao
    Wang, Kuansan
    Han, Jiawei
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3246 - 3257
  • [3] An Efficient and Performance-Aware Big Data Storage System
    Li, Yang
    Guo, Li
    Guo, Yike
    [J]. CLOUD COMPUTING AND SERVICES SCIENCE, CLOSER 2012, 2013, 367 : 102 - 116
  • [4] Supporting Source Code Annotations with Metadata-Aware Development Fnvironment
    Juhar, Jan
    [J]. PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 411 - 420
  • [5] Metadata-Aware Measures for Answer Summarization in Community Question Answering
    Tomasoni, Mattia
    Huang, Minlie
    [J]. ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 760 - 769
  • [6] Hierarchical Metadata-Aware Document Categorization under Weak Supervision
    Zhang, Yu
    Chen, Xiusi
    Meng, Yu
    Han, Jiawei
    [J]. WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 770 - 778
  • [7] A metadata-aware application for remote scoring and exchange of tissue microarray images
    Morris, Lorna
    Tsui, Andrew
    Crichton, Charles
    Harris, Steve
    Maccallum, Peter H.
    Howat, William J.
    Davies, Jim
    Brenton, James D.
    Caldas, Carlos
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [8] A metadata-aware application for remote scoring and exchange of tissue microarray images
    Lorna Morris
    Andrew Tsui
    Charles Crichton
    Steve Harris
    Peter H Maccallum
    William J Howat
    Jim Davies
    James D Brenton
    Carlos Caldas
    [J]. BMC Bioinformatics, 14
  • [9] Sentence Alignment of Bilingual Survey Texts Applying a Metadata-Aware Strategy
    Sorato, Danielly
    Zavala-Rojas, Diana
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 469 - 476
  • [10] TBF: An efficient data architecture for metadata server in the object-based storage network
    Hua, Yu
    Feng, Dan
    Xiao, Bin
    [J]. ICON: 2006 IEEE INTERNATIONAL CONFERENCE ON NETWORKS, VOLS 1 AND 2, PROCEEDINGS: NETWORKING -CHALLENGES AND FRONTIERS, 2006, : 27 - +