Sandbox security model for Hadoop file system

被引:5
|
作者
Begum, Gousiya [1 ,4 ]
Ul Huq, S. Zahoor [2 ]
Kumar, A. P. Siva [3 ]
机构
[1] Mahatma Gandhi Inst Technol, Dept Comp Sci & Engn, Hyderabad, Telangana, India
[2] GPREC, Dept Comp Sci & Engn, Kurnool, Andhra Pradesh, India
[3] JNTUA, Dept Comp Sci & Engn, Anantapuramu, Andhra Pradesh, India
[4] JNTU Anantapur, Anantapuramu, Andhra Pradesh, India
关键词
HDFS; MapReduce; Fsimage; Hadoop; Kerberos;
D O I
10.1186/s40537-020-00356-z
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Extensive usage of Internet based applications in day to day life has led to generation of huge amounts of data every minute. Apart from humans, data is generated by machines like sensors, satellite, CCTV etc. This huge collection of heterogeneous data is often referred as Big Data which can be processed to draw useful insights. Apache Hadoop has emerged has widely used open source software framework for Big Data Processing and it is a cluster of cooperative computers enabling distributed parallel processing. Hadoop Distributed File System is used to store data blocks replicated and spanned across different nodes. HDFS uses an AES based cryptographic techniques at block level which is transparent and end to end in nature. However cryptography provides security from unauthorized access to the data blocks, but a legitimate user can still harm the data. One such example was execution of malicious map reduce jar files by legitimate user which can harm the data in the HDFS. We developed a mechanism where every map reduce jar will be tested by our sandbox security to ensure the jar is not malicious and suspicious jar files are not allowed to process the data in the HDFS. This feature is not present in the existing Apache Hadoop framework and our work is made available in github for consideration and inclusion in the future versions of Apache Hadoop.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Formation of Single and Multinode Clusters in Hadoop Distributed File System
    Begum, A. Aasha
    Chitra, K.
    2017 2ND WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT), 2017, : 162 - 164
  • [32] On the Power of In-Network Caching in the Hadoop Distributed File System
    Newberry, Eric
    Zhang, Beichuan
    PROCEEDINGS OF THE 2019 CONFERENCE ON INFORMATION-CENTRIC NETWORKING (ICN '19), 2019, : 89 - 99
  • [33] Customized Web User Interface for Hadoop Distributed File System
    Krishna, T. Lakshmi Siva Rama
    Ragunathan, T.
    Battula, Sudheer Kumar
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 2, 2016, 380 : 567 - 576
  • [34] Complete Data Deletion Based on Hadoop Distributed File System
    Wang, Fulin
    Wu, Shunxiang
    Cai, Jianhuai
    Zhao, Longze
    Liao, Zhendong
    Ming, Daodong
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [35] A New Replica Placement Policy for Hadoop Distributed File System
    Dai, Wei
    Ibrahim, Ibrahim
    Bassiouni, Mostafa
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC), AND IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2016, : 262 - 267
  • [36] Research of Cloud Storage Based on Hadoop Distributed File System
    Han, Yongqi
    Zhang, Yun
    Yu, Shui
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 2472 - 2475
  • [37] Optimization of Hadoop Small File Storage using Priority Model
    Nivedita, V
    Geetha, J.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1785 - 1789
  • [38] A Load-Balancing Algorithm for Hadoop Distributed File System
    Lin, Chi-Yi
    Lin, Ying-Chen
    PROCEEDINGS 2015 18TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2015), 2015, : 173 - 179
  • [39] Dealing with Small Files Problem in Hadoop Distributed File System
    Bende, Sachin
    Shedge, Ashree
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND VIRTUALIZATION (ICCCV) 2016, 2016, 79 : 1001 - 1012
  • [40] Towards a Better Replica Management for Hadoop Distributed File System
    Ciritoglu, Hilmi Egemen
    Saber, Takfarinas
    Buda, Teodora Sandra
    Murphy, John
    Thorpe, Christina
    2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 104 - 111