Study on the Performance Optimization and Application of Big Model in Big Data Processing

被引:0
|
作者
Wen, Zebin [1 ]
Wang, Ping [1 ]
Zhang, Jiuyang [1 ]
Xiong, Ping [1 ]
机构
[1] Guangdong Univ Sci & Technol, Dongguan, Guangdong, Peoples R China
关键词
Big Data Processing; Data Mining; Parallel Computing; Feature Engineering; Data Preprocessing;
D O I
10.1109/DOCS63458.2024.10704388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the surge in data volume, big data processing faces unprecedented challenges, among which large models have become a hot research topic due to their powerful data processing capabilities. This paper delves into the performance bottlenecks of large models in big data processing and proposes a series of performance optimization strategies. Through a review of existing data processing technologies and large model architectures, combined with optimization theory and practice, this study introduces a comprehensive optimization mechanism that includes resource scheduling, model computational efficiency, and storage and IO. Experiments were conducted in a cloud computing environment to validate these strategies. The results indicate that the optimization strategies significantly enhanced performance when processing different scales of data, improved load balancing and resource utilization, and increased system stability. This research enriches the theoretical study of big data processing and provides effective optimization avenues for the practical application of large models in fields such as data mining and parallel computing. It offers guidance for feature engineering and data preprocessing and paves the way for future research directions.
引用
收藏
页码:650 / 657
页数:8
相关论文
共 50 条
  • [1] Cloud computing model for big data processing and performance optimization of multimedia communication
    Zhou, Zhicheng
    Zhao, Liang
    COMPUTER COMMUNICATIONS, 2020, 160 : 326 - 332
  • [2] The Performance Optimization of Big Data Processing by Adaptive MapReduce Workflow
    Li, Wei
    Tang, Maolin
    IEEE ACCESS, 2022, 10 : 79004 - 79020
  • [3] Modeling and Simulation in Performance Optimization of Big Data Processing Frameworks
    Ranjan, Rajiv
    IEEE CLOUD COMPUTING, 2014, 1 (04): : 14 - 19
  • [4] Performance Evaluation and Optimization of Join Operation in Spark for Big Data Processing
    Qiu, Deyang
    Zhou, Wenli
    Liu, Jun
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2295 - 2298
  • [5] Performance Factor Analysis and Scope of Optimization for Big Data Processing on Cluster
    Godara, Hanuman
    Govil, M. C.
    Pilli, E. S.
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 418 - 423
  • [6] The Application of Big Data Technology in the Optimization of Preschool Education Model
    Huang, Jun
    Wu, Guangzhi
    International Journal of Data Warehousing and Mining, 2024, 20 (01)
  • [7] A Novel Performance Evaluation and Optimization Model for Big Data System
    Xu, Jungang
    Wang, Guolu
    Liu, Shengyuan
    Liu, Renfeng
    2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2016, : 121 - 130
  • [8] Optimization Study of Multidimensional Big Data Matrix Model in Enterprise Performance Evaluation System
    Fu, Honglin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [9] Big Data application on signal processing systems
    Tikhonyuk, A. I.
    Erokhin, S. D.
    Chadov, T. A.
    2018 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SYNCHROINFO), 2018,
  • [10] On the Use of Hyperparameter Optimization in Big Data Processing Pipelines: A Case Study
    Dhaouadi, Jasser
    Aktas, Mehmet S.
    Kalipsiz, Oya
    Balcik, Erman
    2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 496 - 500