Research on Small File Processing Technology Based on HDFS

被引:0
|
作者
Gu, Rui
机构
关键词
HDFS; cloud storage; small files; file merge; insert;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the rapid development of the Internet and the rapid growth of Internet users, the Internet data is also a sharp expansion. The emergence of cloud computing is a good solution to the large data computing and storage problems, massive data storage and analysis has become a very popular research field. HDFS uses a single NameNode to manage the metadata of the entire system, and stores metadata in memory in order to improve access efficiency, but when the system stores a large number of small files, it generates a lot of metadata, occupies larger NameNode memory. In addition, a large number of small file access need to frequently send a request to the NameNode, resulting in the NameNode overload. In view of this problem, this paper analyzes some of the previous research and improvement programs, and on this basis to do a corresponding improvement. On the basis of the original distributed file system, an independent small file processing module was added. The small file processing module merged the small files, created the index of the file, and passed the file cache to HDFS for data processing.
引用
收藏
页码:286 / 289
页数:4
相关论文
共 50 条
  • [41] Research of Massive Small Files Reading Optimization Based on Parallel Network File System
    Yang, Hongzhang
    Zhang, Junwei
    Zeng, Xiangchao
    Dong, Huanqing
    Xu, Lu
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 204 - 212
  • [42] Cloud Distributed File Systems: a Benchmark of HDFS, Ceph, GlusterFS, and XtremeFS
    Acquaviva, Luca
    Bellavista, Paolo
    Corradi, Antonio
    Foschini, Luca
    Gioia, Leo
    Picone, Pasquale Carlo Maiorano
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [43] SD-HDFS: Secure Deletion in Hadoop Distributed File System
    Agrawal, Bikash
    Hansen, Raymond
    Rong, Chunming
    Wiktorski, Tomasz
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 181 - 189
  • [44] Excel File Processing Based on NPOI Package
    Ma, Li-hua
    Li, Sheng-ming
    Wang, Xiao-lan
    Pang, Feng
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM 2018), 2018, 310 : 338 - 341
  • [45] File-based data processing on MESSENGER
    Krupiarz, Christopher J.
    Artis, David A.
    Calloway, Andrew B.
    Frangos, Constantine M.
    Heggestad, Brian K.
    Holland, Douglas B.
    Stratton, William C.
    ACTA ASTRONAUTICA, 2006, 59 (8-11) : 1071 - 1078
  • [46] File-based data processing on MESSENGER
    Krupiarz, CJ
    Artis, DA
    Calloway, AB
    Frangos, CM
    Heggestad, BK
    Holland, DB
    Stratton, WC
    PROCEEDINGS OF THE FIFTH IAA INTERNATIONAL CONFERENCE ON LOW-COST PLANETARY MISSIONS, 2003, 542 : 435 - 442
  • [47] The Research of Electrical equipment identification technology based on image processing
    Shi Guiming
    Liu Zhengjun
    Su Hang
    Wei Qingtao
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 2816 - 2819
  • [48] Research on Decoding QR Code based on Image Processing Technology
    Guo, Jianmin
    Feng, Lijie
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS, 2015, 15 : 1614 - 1620
  • [49] An Algorithm of Merging Small Files in HDFS
    Ren, Xianzhen
    Geng, Xiuhua
    Zhu, Yi
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 24 - 27
  • [50] Research on Plant State Detection Based on Image Processing Technology
    Xu, Xiaoliang
    Han, Fangfang
    Tian, Yifan
    Zhu, Minfeng
    Zhang, Taiping
    Zhang, Tingting
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 400 - 406