Exploiting Parallelism in the Simulation of General Purpose Graphics Processing Unit Program

被引:0
|
作者
赵夏 [1 ,2 ]
马胜 [1 ,2 ]
陈微 [1 ,2 ]
王志英 [1 ,2 ]
机构
[1] State Key Laboratory of High Performance Computing
[2] College of Computer,National University of Defense Technology
基金
国家教育部博士点专项基金资助; 高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
general purpose graphics processing unit(GPGPU); multicore; intra-kernel; inter-kernel; parallel;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
The simulation is an important means of performance evaluation of the computer architecture. Nowadays, the serial simulation of general purpose graphics processing unit(GPGPU) architecture is the main bottleneck for the simulation speed. To address this issue, we propose the intra-kernel parallelization on a multicore processor and the inter-kernel parallelization on a multiple-machine platform. We apply these two methods to the GPGPU-sim simulator. The intra-kernel parallelization method firstly parallelizes the serial simulation of multiple compute units in one cycle. Then it parallelizes the timing and functional simulation to reduce the performance loss caused by the synchronization between different compute units. The inter-kernel parallelization method divides multiple kernels of a CUDA program into several groups and distributes these groups across multiple simulation hosts to perform the simulation. Experimental results show that the intra-kernel parallelization method achieves a speed-up of up to 12 with a maximum error rate of 0.009 4% on a 32-core machine, and the inter-kernel parallelization method can accelerate the simulation by a factor of up to 3.9 with a maximum error rate of 0.11% on four simulation hosts. The orthogonality between these two methods allows us to combine them together on multiple multi-core hosts to get further performance improvements.
引用
收藏
页码:280 / 288
页数:9
相关论文
共 50 条
  • [31] FINDING AND EXPLOITING PARALLELISM IN AN OCEAN SIMULATION PROGRAM - EXPERIENCE, RESULTS, AND IMPLICATIONS
    SINGH, JP
    HENNESSY, JL
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 15 (01) : 27 - 48
  • [32] Special issue on evolutionary computation on general purpose graphics processing units
    José L. Risco-Martín
    Juan Lanchares
    Carlos A. Coello-Coello
    Soft Computing, 2012, 16 : 185 - 186
  • [33] Stochastic Speculative Computation Method on general purpose graphics processing units
    Suzuki, Yosuke
    Hamada, Akira
    Iizuka, Yasuki
    2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 1049 - 1050
  • [34] Special issue on evolutionary computation on general purpose graphics processing units
    Risco-Martin, Jose L.
    Lanchares, Juan
    Coello-Coello, Carlos A.
    SOFT COMPUTING, 2012, 16 (02) : 185 - 186
  • [35] Accelerating in-memory transaction processing using general purpose graphics processing units
    Gao, Lan
    Xu, Yunlong
    Wang, Rui
    Yang, Hailong
    Luan, Zhongzhi
    Qian, Depei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 836 - 848
  • [36] Accelerator: Using data parallelism to program GPUs for general-purpose uses
    Tarditi, David
    Puri, Sidd
    Oglesby, Jose
    ACM SIGPLAN NOTICES, 2006, 41 (11) : 325 - 335
  • [37] Neural Networks Training on Graphics Processing Unit (GPU) Using Dynamic Parallelism (DP)
    Hall, Will
    Tian, Yun
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 811 - 818
  • [38] Image processing as an exemplar of parallelism applied to graphics
    Smith, Harry F.
    Plusnick, Patrick
    Sarojak, Mark
    Seitz, William
    SIGCSE Bulletin (Association for Computing Machinery, Special Interest Group on Computer Science Education), 1996, : 363 - 367
  • [39] General purpose molecular dynamics simulations fully implemented on graphics processing units
    Anderson, Joshua A.
    Lorenz, Chris D.
    Travesset, A.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2008, 227 (10) : 5342 - 5359
  • [40] Passive Radar Parallel Processing Using General-Purpose Computing on Graphics Processing Units
    Szczepankiewicz, Karolina
    Malanowski, Mateusz
    Szczepankiewicz, Michal
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2015, 61 (04) : 357 - 363