Sandbox security model for Hadoop file system

被引:5
|
作者
Begum, Gousiya [1 ,4 ]
Ul Huq, S. Zahoor [2 ]
Kumar, A. P. Siva [3 ]
机构
[1] Mahatma Gandhi Inst Technol, Dept Comp Sci & Engn, Hyderabad, Telangana, India
[2] GPREC, Dept Comp Sci & Engn, Kurnool, Andhra Pradesh, India
[3] JNTUA, Dept Comp Sci & Engn, Anantapuramu, Andhra Pradesh, India
[4] JNTU Anantapur, Anantapuramu, Andhra Pradesh, India
关键词
HDFS; MapReduce; Fsimage; Hadoop; Kerberos;
D O I
10.1186/s40537-020-00356-z
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Extensive usage of Internet based applications in day to day life has led to generation of huge amounts of data every minute. Apart from humans, data is generated by machines like sensors, satellite, CCTV etc. This huge collection of heterogeneous data is often referred as Big Data which can be processed to draw useful insights. Apache Hadoop has emerged has widely used open source software framework for Big Data Processing and it is a cluster of cooperative computers enabling distributed parallel processing. Hadoop Distributed File System is used to store data blocks replicated and spanned across different nodes. HDFS uses an AES based cryptographic techniques at block level which is transparent and end to end in nature. However cryptography provides security from unauthorized access to the data blocks, but a legitimate user can still harm the data. One such example was execution of malicious map reduce jar files by legitimate user which can harm the data in the HDFS. We developed a mechanism where every map reduce jar will be tested by our sandbox security to ensure the jar is not malicious and suspicious jar files are not allowed to process the data in the HDFS. This feature is not present in the existing Apache Hadoop framework and our work is made available in github for consideration and inclusion in the future versions of Apache Hadoop.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] An enhancement of data locality in Hadoop distributed file system
    Reddy, A. Siva Krishna
    Sujatha, Pothula
    Koti, Prasad
    Dhavachelvan, P.
    Amudhavel, J.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2018, 11 (01): : 123 - 133
  • [22] Developing Architectural Documentation for the Hadoop Distributed File System
    Bass, Len
    Kazman, Rick
    Ozkaya, Ipek
    OPEN SOURCE SYSTEMS: GROUNDING RESEARCH, 2011, 365 : 50 - 61
  • [23] Hadoop Distributed File System for Big data analysis
    Almansouri, Hatim Talal
    Masmoudi, Youssef
    PROCEEDINGS OF 2019 IEEE 4TH WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS' 19), 2019, : 257 - 261
  • [24] Dynamic Preclusion of Encroachment in Hadoop Distributed File System
    Saranya, S.
    Sarumathi, M.
    Swathi, B.
    Paul, P. Victer
    Kumar, S. Sampath
    Vengattaraman, T.
    BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 531 - 536
  • [25] Enhancing Replica Synchronization in Hadoop Distributed File System
    Kumari, Jyoti
    Biswas, Tarun
    Vuppala, Satyanarayana
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [26] Optimization of Small Sized File Access Efficiency in Hadoop Distributed File System by Integrating Virtual File System Layer
    Alange, Neeta
    Mathur, Anjali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 204 - 210
  • [27] A token-based authentication security scheme for Hadoop distributed file system using elliptic curve cryptography
    Jeong, Yoon-Su
    Kim, Yong-Tae
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2015, 11 (03): : 137 - 142
  • [28] An Efficient Data Duplication System based on Hadoop Distributed File System
    Veeraiah, D.
    Rao, J. Nageswara
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 197 - 200
  • [29] Data Adaptively Storing Approach for Hadoop Distributed File System
    Fu, Yingxun
    Wen, Shilin
    Ma, Li
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 20 - 24
  • [30] A CKAN Plugin for Data Harvesting to the Hadoop Distributed File System
    Scholz, Robert
    Tcholtchev, Nikolay
    Laemmel, Philipp
    Schieferdecker, Ina
    CLOSER: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2017, : 19 - 28