Distributed file system for rewriting Big Data files using a local-write protocol

被引:1
|
作者
da Silva, Erico Correia [1 ]
Sato, Liria Matsumoto [1 ]
Midorikawa, Edson Toshimi [1 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Sao Paulo, Brazil
关键词
Distributed file systems; Hadoop; Big Data; Distributed lock management;
D O I
10.1109/BigData52589.2021.9671741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the exponential volume growth of the data available for scientific and commercial use, more and more Big Data technologies are gaining focus and importance. Directly related to the efficiency of these techniques is the distributed file system used for data persistence, generally based on low-cost computer clusters. However, the environments used today for Big Data are based on file systems restricted to the WORM pattern (write once, read many) lacking POSIX compatibility. This work uses distributed lock management techniques to create a file system that allows random writing for both HPC and Big Data tools. A local write protocol is implemented to leverage the use of local copies of the data during the write process. Experiments were carried out to evaluate the performance of the proposed write protocol and the scalability of the developed file system. From the experimental results, it is possible to conclude that the achieved performance and scalability improvements were obtained by eliminating limitations imposed by HDFS and leveraging local writes.
引用
收藏
页码:3646 / 3655
页数:10
相关论文
共 50 条
  • [41] An enhancement of data locality in Hadoop distributed file system
    Reddy, A. Siva Krishna
    Sujatha, Pothula
    Koti, Prasad
    Dhavachelvan, P.
    Amudhavel, J.
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2018, 11 (01): : 123 - 133
  • [42] Σ-Tree:Design of a Data Structure for Storing File Data Allocation Map in a Distributed File System
    Dewan, Hrishikesh
    Hansdah, R. C.
    Singh, Prashant
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2018, : 90 - 98
  • [43] Development of distributed file system for storing weather data
    Sherstnev, V. S.
    Botygin, I. A.
    Zenzin, A. S.
    Sherstneva, A. I.
    Galanova, N. Y.
    [J]. 22ND INTERNATIONAL SYMPOSIUM ON ATMOSPHERIC AND OCEAN OPTICS: ATMOSPHERIC PHYSICS, 2016, 10035
  • [44] Distributed File System to Leverage Data Locality for Large-File Processing
    da Silva, Erico Correia
    Sato, Liria Matsumoto
    Midorikawa, Edson Toshimi
    [J]. ELECTRONICS, 2024, 13 (01)
  • [45] THE FRAMEWORK OF A DISTRIBUTED FILE SYSTEM FOR GEOSPATIAL DATA MANAGEMENT
    Cui, Jifeng
    Li, Chao
    Xing, Chunxiao
    Zhang, Yong
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 183 - 187
  • [46] Performance Evaluations of Distributed File Systems for Scientific Big Data in FUSE Environment
    Lee, Jun-Yeong
    Kim, Moon-Hyun
    Shah, Syed Asif Raza
    Ahn, Sang-Un
    Yoon, Heejun
    Noh, Seo-Young
    [J]. ELECTRONICS, 2021, 10 (12)
  • [47] XPMFS: A New NVM File System for Vehicle Big Data
    Niu, Dejiao
    He, Qingjian
    Cai, Tao
    Chen, Bo
    Zhan, Yongzhao
    Liang, Jun
    [J]. IEEE ACCESS, 2018, 6 : 34863 - 34873
  • [48] Efficient Prefetching Technique for Storage of Heterogeneous small files in Hadoop Distributed File System Federation
    Aishwarya, K.
    Ram, Arvind A.
    Sreevatson, M. C.
    Babu, Chitra
    Prabavathy, B.
    [J]. 2013 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2013, : 523 - 530
  • [49] Efficient and Secure Distributed Data Storage and Retrieval Using Interplanetary File System and Blockchain
    Bin Saif, Muhammad
    Migliorini, Sara
    Spoto, Fausto
    [J]. FUTURE INTERNET, 2024, 16 (03)
  • [50] Semantic Distributed Data for Vehicular Networks Using the Inter-Planetary File System
    Ortega, Victor
    Monserrat, Jose F.
    [J]. SENSORS, 2020, 20 (22) : 1 - 21