ROVER: Robust and Verifiable Erasure Code for Hadoop Distributed File Systems

被引:0
|
作者
Wang, Teng [1 ]
Nam Son Nguyen [1 ]
Wang, Jiayin [2 ]
Li, Tengpeng [1 ]
Zhang, Xiaoqian [1 ]
Mi, Ningfang [3 ]
Zhao, Bin [4 ]
Sheng, Bo [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, 100 Morrissey Blvd, Boston, MA 02125 USA
[2] Montclair State Univ, Dept Comp Sci, 1 Normal Ave, Montclair, NJ 07043 USA
[3] Northeastern Univ, Dept Elect & Comp Engn, 360 Huntington Ave, Boston, MA 02115 USA
[4] Nanjing Normal Univ, Sch Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Erasure Coding based Storage (ECS) is replacing replica-based systems because of its low storage overhead. In an ECS, however, every task needs to fetch remote pieces of data for its execution, and data verification is missing in the current framework. As security issues keep rising and there have been security incidents occurred in big data platforms, the compromised nodes in a computing cluster may manipulate its hosted data fed for other nodes yielding misleading results. Without replicas, it is quite challenging to efficiently verify the data integrity in ECS. In this paper, we develop ROVER, which is an efficient and verifiable ECS for big data platforms. In ROVER, every piece of data is monitored by its checksums stored on a set of witnesses. Bloom filter technique is used on each witness to efficiently keep the records of the checksums. The data verification is based on the majority voting. ROVER also supports a quick reconstruction of Bloom Filter when a node recovers from a failure. We present a complete system framework, security analysis, and a guideline for setting the parameters. The implementation and evaluation show that ROVER is robust and efficient against the attack from the compromised nodes.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Decentralised erasure code for Hadoop distributed cloud file systems
    Mohana Prasad, K.
    Kiriti, S.
    Reddy, V.T. Sudharshan
    John, Albert Mayan
    [J]. International Journal of Cloud Computing, 2022, 11 (5-6) : 552 - 559
  • [2] Erasure Code of Small File in a Distributed File System
    Chen, Xinhai
    Liu, Jie
    Xie, Peizhen
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2549 - 2554
  • [3] An Efficient Binary Locally Repairable Code for Hadoop Distributed File System
    Shahabinejad, Mostafa
    Khabbazian, Majid
    Ardakani, Masoud
    [J]. IEEE COMMUNICATIONS LETTERS, 2014, 18 (08) : 1287 - 1290
  • [4] The Hadoop Distributed File System
    Shvachko, Konstantin
    Kuang, Hairong
    Radia, Sanjay
    Chansler, Robert
    [J]. 2010 IEEE 26TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2010,
  • [5] A NOVEL APPROACH FOR REPLICA SYNCHRONIZATION IN HADOOP DISTRIBUTED FILE SYSTEMS
    Vini, Miss. J.
    Nallathamby, Rachel
    Robin, C. R. Rene
    [J]. BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 590 - 595
  • [6] Adaptive erasure code based distributed storage systems
    Rai, Brijesh Kumar
    [J]. 2015 IEEE 14TH CANADIAN WORKSHOP ON INFORMATION THEORY (CWIT), 2015, : 174 - 177
  • [7] Hadoop Distributed File System for the Grid
    Attebury, Garhan
    Baranovski, Andrew
    Bloom, Ken
    Bockelman, Brian
    Kcira, Dorian
    Letts, James
    Levshina, Tanya
    Lundestedt, Carl
    Martin, Terrence
    Maier, Will
    Pi, Haifeng
    Rana, Abhishek
    Sfiligoi, Igor
    Sim, Alexander
    Thomas, Michael
    Wuerthwein, Frank
    [J]. 2009 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2009, : 1056 - +
  • [8] Research on Distributed File System with Hadoop
    Xu, JunWu
    Liang, JunLing
    [J]. NETWORK COMPUTING AND INFORMATION SECURITY, 2012, 345 : 148 - +
  • [9] The Evolution of the Hadoop Distributed File System
    Maneas, Stathis
    Schroeder, Bianca
    [J]. 2018 32ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2018, : 67 - 74
  • [10] An Erasure Code With Reduced Average Locality for Distributed Storage Systems
    Shahabinejad, Mostafa
    Ardakani, Masoud
    Khabbazian, Majid
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2016, : 427 - 431