Efficient Handling of Heterogeneous File Formats in HDFS

被引:0
|
作者
Prashant, More Vaishali [1 ]
Raut, Suhas D. [1 ]
机构
[1] NK Orchid Coll Engn & Tech, Dept Comp Sci & Engn, Solapur, Maharashtra, India
关键词
Big Data; Hadoop; HDFS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The amount of data in our industry and the world is exploding. Big data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured. In an Organization, there are multiple types of documents collected from the different sources. This documents that needs to be accessible immediately; documents that needs to be accessed within a few seconds or minutes; and documents that is accessed in frequently. While these types of documents play different roles within an organization, each is valuable. These different types of documents require different kinds of storage solutions. For handling of such heterogeneous file format we use Hadoop. In Hadoop, storage of different documents is provided by HDFS (Hadoop Distributed File System). Also in educational organization, documents categorization is one of the most important tasks. Availability of a document and need of providing a category to a document motivated for implementing this project.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] An archive-based method for efficiently handling small file problems in HDFS
    Liu, Junnan
    Jin, Shengyi
    Wang, Dong
    Li, Han
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (24):
  • [2] A Novel Approach for Efficient Handling of Small Files in HDFS
    Patel, Ankita
    Mehta, Mayuri A.
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 1258 - 1262
  • [3] Energy-efficient algorithms for distributed file system HDFS
    Liao, Bin
    Yu, Jiong
    Zhang, Tao
    Yang, Xing-Yao
    Jisuanji Xuebao/Chinese Journal of Computers, 2013, 36 (05): : 1047 - 1064
  • [4] Proper use of common image file formats in handling radiological images
    Faccioli, N.
    Perandini, S.
    Comai, A.
    D'Onofrio, M.
    Mucelli, R. Pozzi
    RADIOLOGIA MEDICA, 2009, 114 (03): : 484 - 495
  • [5] An Improved HDFS for Small File
    Liu Changtong
    2016 18TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - INFORMATION AND COMMUNICATIONS FOR SAFE AND SECURE LIFE, 2016, : 474 - 477
  • [6] A Distributed File System Based on HDFS
    Liu J.
    Leng F.-L.
    Li S.-Q.
    Bao Y.-B.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (06): : 795 - 800
  • [7] Object file formats
    Gray, R
    Mulchandani, D
    DR DOBBS JOURNAL, 1997, 22 (05): : 47 - +
  • [8] Open file formats
    Marks, Matthew
    Engineering and Technology, 2007, 2 (12):
  • [9] Object file formats
    Gray, Rand
    Mulchandani, Deepak
    Dr. Dobb's Journal of Software Tools for Professional Programmer, 1997, 22 (05):
  • [10] FILE FORMATS ON THE INTERNET
    PERKINS, R
    COMPUTERS & GEOSCIENCES, 1995, 21 (06) : 775 - 777