Cluster optimization algorithm based on CPU and GPU hybrid architecture

被引:0
|
作者
Fei Yin
Feng Shi
机构
[1] Beijing Institute of Technology,College of Computer Science and Technology
来源
Cluster Computing | 2022年 / 25卷
关键词
CPU/GPU heterogeneous system; Performance optimization; Load balancing; Parallel computing model;
D O I
暂无
中图分类号
学科分类号
摘要
With the rapid development of network technology and parallel computing, clusters formed by connecting a large number of PCs with high-speed networks have gradually replaced the status of supercomputers in scientific research and production and high-performance computing with cost-effective advantages. The research purpose of this paper is to integrate the Kriging proxy model method and energy efficiency modeling method into a cluster optimization algorithm of CPU and GPU hybrid architecture. This paper proposes a parallel computing model for large-scale CPU/GPU heterogeneous high-performance computing systems, which can effectively describe the computing capabilities and various communication behaviors of CPU/GPU heterogeneous systems, and finally provide algorithm optimization for CPU/GPU heterogeneous clusters. According to the GPU architecture, an efficient method of constructing a Kriging proxy model and an optimized search algorithm are designed. The experimental results in this paper show that the construction of the Kriging proxy model can obtain a 220 times speedup ratio, and the search algorithm can reach an 8 times speedup ratio. It can be seen that this heterogeneous cluster optimization algorithm has high feasibility.
引用
收藏
页码:2601 / 2611
页数:10
相关论文
共 50 条
  • [41] Architecture for Fast Object Detection Supporting CPU-GPU Hybrid and Distributed Computing
    Bae, Yuseok
    Park, Jongyoul
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2017,
  • [42] Optimization of Parallel Algorithm for Kalman Filter on CPU-GPU Heterogeneous System
    Xu, Dandan
    Xiao, Zheng
    Li, Dapu
    Wu, Fan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 2165 - 2172
  • [43] A hybrid northern goshawk optimization algorithm based on cluster collaboration
    Wu, Changjun
    Li, Qingzhen
    Wang, Qiaohua
    Zhang, Huanlong
    Song, Xiaohui
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 13203 - 13237
  • [44] Implementation and Analysis of the Histograms of Oriented Gradients Algorithm on a Heterogeneous Multicore CPU/GPU Architecture
    Arndt, Oliver Jakob
    Linde, Tobias
    Blume, Holger
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 1402 - 1406
  • [45] Vidushi: Parallel Implementation of Alpha Miner Algorithm and Performance Analysis on CPU and GPU Architecture
    Kundra, Divya
    Juneja, Prerna
    Sureka, Ashish
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, (BPM 2015), 2016, 256 : 230 - 241
  • [46] ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture
    Zou, Huihui
    Tang, Shanjiang
    Yu, Ce
    Fu, Hao
    Li, Yusen
    Tang, Wenjie
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (03) : 388 - 402
  • [47] Column-Stored System Join Optimization on Coupled CPU-GPU Architecture
    Ding, Xiangwu
    Li, Zitong
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 184 - 191
  • [48] Learning Based Performance and Power Efficient Cluster Resource Manager for CPU-GPU Cluster
    Das, Soumen Kumar
    Sudhakaran, G.
    Ashok, V.
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 161 - 166
  • [49] Fast hybrid CPU- and GPU-based CT reconstruction algorithm using air skipping technique
    Lee, Byeonghun
    Lee, Ho
    Shin, Yeong Gil
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2010, 18 (03) : 221 - 234
  • [50] Fast Snippet Generation Based On CPU-GPU Hybrid System
    Liu, Ding
    Li, Ruixuan
    Gu, Xiwu
    Wen, Kunmei
    He, Heng
    Gao, Guoqiang
    2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 252 - 259