Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

被引:1
|
作者
Wang, Yajun [1 ]
Cheng, Shengming [1 ]
Zhang, Xinchen [1 ]
Leng, Junyu [1 ]
Liu, Jun [1 ]
机构
[1] Dalian Polytech Univ, Sch Mech Engn & Automat, Dalian 116034, Peoples R China
关键词
ENTROPY;
D O I
10.1155/2021/3839800
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources. A block storage layout optimization method based on the Hadoop platform and a parallel data processing and analysis method based on the MapReduce model are proposed. A multireplica consistent hashing algorithm based on data correlation and spatial and temporal properties is used in the parallel data processing and analysis method. The data distribution strategy and block size adjustment are studied based on the Hadoop platform. A multidata source parallel join query algorithm and a multi-channel data fusion feature extraction algorithm based on data-optimized storage are designed for the big data resources of seafood products according to the MapReduce parallel frame work. Practical verification shows that the storage optimization and data-retrieval methods provide supports for constructing a big data resource-management platform for seafood products and realize efficient organization and management of the big data resources of seafood products. The execution time of multidata source parallel retrieval is only 32% of the time of the standard Hadoop scheme, and the execution time of the multichannel data fusion feature extraction algorithm is only 35% of the time of the standard Hadoop scheme.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Performance Challenges and Solutions in Big Data Platform Hadoop
    Singh, Balraj
    Verma, Harsh K.
    Madaan, Vishu
    [J]. Recent Advances in Computer Science and Communications, 2023, 16 (09)
  • [32] Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing
    Lim, JongBeom
    Ahnh, Jong-Suk
    Lee, Kang-Woo
    [J]. ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2314 - 2319
  • [33] Design and Development of a Medical Big Data Processing System Based on Hadoop
    Yao, Qin
    Tian, Yu
    Li, Peng-Fei
    Tian, Li-Li
    Qian, Yang-Ming
    Li, Jing-Song
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2015, 39 (03)
  • [34] Design and Development of a Medical Big Data Processing System Based on Hadoop
    Qin Yao
    Yu Tian
    Peng-Fei Li
    Li-Li Tian
    Yang-Ming Qian
    Jing-Song Li
    [J]. Journal of Medical Systems, 2015, 39
  • [35] Design and Implementation of Sensory Data Collection and Storage Based on Hadoop Platform
    Bai, Zhen
    Cui, Shaohua
    Zhao, Chenglin
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 870 - 874
  • [36] A Design of Processing Platform for Smartphone Based on Big Data
    Zhang, Jie
    Zhang, Ke
    Zhou, Hengxin
    [J]. 2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 162 - 165
  • [37] A Sensor Data Processing and Access Platform based on Hadoop for Smart Environments
    Lin, Chi-Yi
    Li, Chia-Chen
    Huang, Wei-Hsun
    Liao, Wei-Che
    Chen, Wei-Ming
    [J]. 2014 17TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2014), 2014, : 453 - 458
  • [38] Big Data Processing Using Hadoop and Spark: The Case of Meteorology Data
    Hussein, Eslam
    Sadiki, Ronewa
    Jafta, Yahlieel
    Sungay, Muhammad Mujahid
    Ajayi, Olasupo
    Bagula, Antoine
    [J]. E-INFRASTRUCTURE AND E-SERVICES FOR DEVELOPING COUNTRIES (AFRICOMM 2019), 2020, 311 : 180 - 185
  • [39] Research on adaptive recommendation algorithm for big data mining based on Hadoop platform
    Zhang, Jinming
    [J]. INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2019, 12 (04) : 213 - 220
  • [40] EMM: Extended matching market based scheduling for big data platform hadoop
    Balraj Singh
    Harsh K Verma
    [J]. Multimedia Tools and Applications, 2022, 81 : 34823 - 34847