Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

被引:1
|
作者
Wang, Yajun [1 ]
Cheng, Shengming [1 ]
Zhang, Xinchen [1 ]
Leng, Junyu [1 ]
Liu, Jun [1 ]
机构
[1] Dalian Polytech Univ, Sch Mech Engn & Automat, Dalian 116034, Peoples R China
关键词
ENTROPY;
D O I
10.1155/2021/3839800
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources. A block storage layout optimization method based on the Hadoop platform and a parallel data processing and analysis method based on the MapReduce model are proposed. A multireplica consistent hashing algorithm based on data correlation and spatial and temporal properties is used in the parallel data processing and analysis method. The data distribution strategy and block size adjustment are studied based on the Hadoop platform. A multidata source parallel join query algorithm and a multi-channel data fusion feature extraction algorithm based on data-optimized storage are designed for the big data resources of seafood products according to the MapReduce parallel frame work. Practical verification shows that the storage optimization and data-retrieval methods provide supports for constructing a big data resource-management platform for seafood products and realize efficient organization and management of the big data resources of seafood products. The execution time of multidata source parallel retrieval is only 32% of the time of the standard Hadoop scheme, and the execution time of the multichannel data fusion feature extraction algorithm is only 35% of the time of the standard Hadoop scheme.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] EMM: Extended matching market based scheduling for big data platform hadoop
    Singh, Balraj
    Verma, Harsh K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34823 - 34847
  • [42] An Enhanced Apriori Algorithm Using Hybrid Data Layout Based on Hadoop for Big Data Processing
    Rochd, Yassir
    Hafidi, Imad
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (06): : 161 - 167
  • [43] Towards a framework for large-scale multimedia data storage and processing on Hadoop platform
    Lai, Wei Kuang
    Chen, Yi-Uan
    Wu, Tin-Yu
    Obaidat, Mohammad S.
    [J]. JOURNAL OF SUPERCOMPUTING, 2014, 68 (01): : 488 - 507
  • [44] Towards a framework for large-scale multimedia data storage and processing on Hadoop platform
    Wei Kuang Lai
    Yi-Uan Chen
    Tin-Yu Wu
    Mohammad S. Obaidat
    [J]. The Journal of Supercomputing, 2014, 68 : 488 - 507
  • [45] The Research of Massive Data Analysis and Processing Based on Hadoop
    Yi, Julan
    [J]. PROCEEDINGS OF THE 2015 3RD INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND INFORMATION TECHNOLOGY APPLICATIONS, 2015, 35 : 273 - 277
  • [46] Distributed Case-based Reasoning System Based on Big Data Platform Hadoop
    Wang, Chong-Yang
    Wang, Hong-Bing
    Liang, Yan-Rui
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 629 - 634
  • [47] Hadoop Paradigm for Satellite Environmental Big Data Processing
    Semlali, Badr-Eddine Boudriki
    El Amrani, Chaker
    Ortiz, Guadalupe
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 23 - 47
  • [48] A Time Based Analysis of Data Processing on Hadoop Cluster
    Pal, Amrit
    Agrawal, Sanjay
    [J]. 2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 608 - 612
  • [49] Big data processing and analysis platform based on deep neural network model
    Huang, Sheng
    [J]. SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [50] Extensions to the Pig Data Processing Platform for Scalable RDF Data Processing Using Hadoop
    Tanimura, Yusuke
    Matono, Akiyoshi
    Lynden, Steven
    Kojima, Isao
    [J]. 2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 251 - 256