Study on the Performance Optimization and Application of Big Model in Big Data Processing

被引:0
|
作者
Wen, Zebin [1 ]
Wang, Ping [1 ]
Zhang, Jiuyang [1 ]
Xiong, Ping [1 ]
机构
[1] Guangdong Univ Sci & Technol, Dongguan, Guangdong, Peoples R China
关键词
Big Data Processing; Data Mining; Parallel Computing; Feature Engineering; Data Preprocessing;
D O I
10.1109/DOCS63458.2024.10704388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the surge in data volume, big data processing faces unprecedented challenges, among which large models have become a hot research topic due to their powerful data processing capabilities. This paper delves into the performance bottlenecks of large models in big data processing and proposes a series of performance optimization strategies. Through a review of existing data processing technologies and large model architectures, combined with optimization theory and practice, this study introduces a comprehensive optimization mechanism that includes resource scheduling, model computational efficiency, and storage and IO. Experiments were conducted in a cloud computing environment to validate these strategies. The results indicate that the optimization strategies significantly enhanced performance when processing different scales of data, improved load balancing and resource utilization, and increased system stability. This research enriches the theoretical study of big data processing and provides effective optimization avenues for the practical application of large models in fields such as data mining and parallel computing. It offers guidance for feature engineering and data preprocessing and paves the way for future research directions.
引用
收藏
页码:650 / 657
页数:8
相关论文
共 50 条
  • [31] Boosting Heapsort Performance of Processing Big Data Streams
    Algemili, Usamah
    Alhudhaif, Adi
    SOUTHEASTCON 2016, 2016,
  • [32] Big data processing tools: An experimental performance evaluation
    Rodrigues, Mario
    Santos, Maribel Yasmina
    Bernardino, Jorge
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (02)
  • [33] The research and application of a big data storage model
    Liu, Na
    Zhou, Jianfei
    International Journal of Database Theory and Application, 2015, 8 (04): : 319 - 330
  • [34] Optimization of Management and Processing of Big Data on a Platform for Distributed Data Storage
    Neric, Vedrana
    Sarajlic, Nermin
    Hadzic, Dulaga
    ELEKTROTEHNISKI VESTNIK, 2024, 91 (05): : 272 - 283
  • [35] Optimization of Management and Processing of Big Data on a Platform for Distributed Data Storage
    Nerić, Vedrana
    Sarajlić, Nermin
    Hadžić, Đulaga
    Elektrotehniski Vestnik/Electrotechnical Review, 2024, 91 (05): : 272 - 283
  • [36] Big Mechanisms for Processing Big Data in Medical Informatics
    Harabagiu, Sanda
    2015 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM (SPMB), 2015,
  • [37] A learned cost model for big data query processing
    Li, Yan
    Wang, Liwei
    Wang, Sheng
    Sun, Yuan
    Zheng, Bolong
    Peng, Zhiyong
    INFORMATION SCIENCES, 2024, 670
  • [38] Parameter tuning of big data platforms for performance optimization
    Pattanshetti, Tanuja
    Attar, Vahida
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2020, 41 (02): : 403 - 410
  • [39] Cloud Computing Model for Big Geological Data Processing
    Song, Miaomiao
    Li, Zhe
    Zhou, Bin
    Li, Chaoling
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS II, PTS 1 AND 2, 2014, 475-476 : 306 - +
  • [40] Clustering Model of Cloud Centers for Big Data Processing
    Klymash, Mykhailo
    Pelch, Nazar
    Shpur, Olga
    Lutsiuk, Iryna
    2018 14TH INTERNATIONAL CONFERENCE ON ADVANCED TRENDS IN RADIOELECTRONICS, TELECOMMUNICATIONS AND COMPUTER ENGINEERING (TCSET), 2018, : 268 - 271